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A NEW ROTATIONAL METHOD IN FACTOR ANALYSIS 


L. L. THURSTONE 
The University of Chicago 


A new rotational method, particularly appropriate when a posi- 
tive manifold can be assumed, is presented in theory and in compu- 
tational detail. It is applied to a five-dimensional factorial matrix 
in which a simple configuration is known to exist. The new tech- 
nique is much more efficient in yielding the simple configuration 
than are the methods formerly used. Landahl’s transformations can 
be combined with this method to advantage. 


Since the publication of The Vectors of Mind, several new pro- 
cedures have been tried to shorten the computational work of isolat- 
ing a simple configuration in a test battery. Some of these procedures 
have proved to be successful in a number of factorial studies, and it 
now seems desirable to make them available by publication. Since the 
new method constitutes a very marked improvement on the older 
methods, we have entirely abandoned the longer methods that have 
been previously described. The present method is especially well 
suited to problems with psychological tests in which a positive mani- 
fold can be assumed. This assumption is independent of the existence 
of a simple configuration. In addition to a description of the new 
method as regards its theory, we shall give here also a complete nu- 
merical example in sufficient detail to serve as a model for the com- 
putational work. The new method gives the same solution as the older 
methods, but it is much more efficient in finding the simple configura- 
tion if it exists in the test battery. Eventually even the present meth- 
od will seem cumbersome with further improvements in technique. 

The new method consists, in effect, in taking three-dimensional 
sections of the configuration instead of two-dimensional sections as 
before.* But each three-dimensional section is represented in the 
plane of a drawing, so that solid models are not necessary. With the 
new rotational method a four-dimensional problem collapses immedi- 
ately to the simple configuration. We have, therefore, chosen a five- 
dimensional problem for the numerical example so that several rota- 
tions are necessary to reach the solution. 

It will be assumed here that the correlational matrix has been 
factored by the centroid method} or by an equivalent method. Our 


* Thurstone, L. L., Primary Mental Abilities, p. 73, The University of Chi- 
cago Press, 1938. 

7 Thurstone, L. L., The Vectors of Mind, Chapter III, Chicago: The Univer- 
sity of Chicago Press, 1985. 
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problem starts, then, with the rotation of the coordinate axes of the 
centroid matrix. 

In determining the rotations of the reference axes in the older 
method we plotted on cross section paper every pair of columns of 
the centroid factorial matrix. A radial streak or band of points on 
such a diagram indicated the rotation of the axes so as to maximize 
the number of nearly vanishing projections of the test vectors. It was 
necessary that the band of points be radial. The new method allows 
one to draw a line through any straight band of points on the dia- 
gram, radial or non-radial, and to determine the new reference vector 
so that all points on that line have zero projections on the new refer- 
ence vector. A logical exposition would give the theory first and the 
numerical example later, but it will probably be clearer to present 
them together. 

A fictitious five-dimensional factorial matrix was constructed as 
shown in Table 1. It represents a positive simple structure as may 
be expected in a battery of psychological tests. There are 25 fictitious 
tests. The first test calls for only two primary factors, namely 2 and 
5, in that only these two entries in the first row are non-vanishing. 
Each row can be interpreted in the same manner. The number of 
zero entries in this table is such that the battery shows a simple struc- 
ture. The axes implied in Table 1 were rotated by an arbitrary or- 
thogonal transformation A, as shown in Table 2. This transformation 
was so adjusted that the first axis contained the centroid of the con- 
figuration. After rotation the configuration is described in terms of 
a new arbitrary orthogonal reference frame. The new factorial ma- 
trix is shown in Table 3. We shall assume that this factorial matrix 
is given, and that it is our problem to discover the simple configura- 
tion by a method that will produce the matrix we started with, name- 
ly, Table 1. It is as if we were asked to rotate the axes of Table 3 
without knowing anything about the original in Table 1. Table 3 rep- 
resents a centroid matrix obtained from the intertest correlations. 

The new method consists of several distinct operations that can 
be denoted as follows: 

1) The extension of each test vector so that its projection on 
the centroid axis is unity. The termini of all the test vectors then lie 
in a hyperplane orthogonal to the centroid axis and at unit distance 
from the origin. The matrix of direction numbers of the extended 
test vectors will be called the extended factorial matrix. 

2) An arbitrary orthogonal rotation of the reference vectors so 
that they all make the same angle with the centroid axis. This arbi- 
trary rotation of the reference axes was suggested by Mr. Herbert 
Landahl, and he has prepared a set of orthogonal transformations to 
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bring all the axes to the same angle with the centroid axis. Such ro- 
tations are not unique, but he has prepared a computationally con- 
venient orthogonal transformation for each order. These will be re- 
ferred to as Landahl’s transformation.* The method can be used 
without Landahl’s transformations, but it is more effective if these 
transformations are used. 

3) The pairs of columns of the extended factorial matrix are 


plotted as formerly. 








FIGURE 1 


4) New reference axes are chosen by these graphs. The process 
in steps (3) and (4) is repeated until no further rotations are evident. 

5) The vectors are reduced to their original length and a fac- 
torial matrix prepared. The columns of this factorial matrix repre- 
sent the primary abilities. 

In Figure 1 several test vectors are represented. The axis / is 
the first centroid axis. The extensions of the test vectors are repre- 
sented by dotted lines. They are extended so that they all have unit 
projection on the first centroid axis. The vertical line represents the 
hyperplane that contains the termini of all the test vectors. This hy- 
perplane is orthogonal to the centroid axis 7, and it is at unit distance 
from the origin. 


* Landahl, H. D., “Centroid Orthogonal Transformations,’ Psychometrika, 
current issue. 





202 PSYCHOMETRIKA 


III 











FIGURE 2 


The extended factorial matrix E,, is shown in Table 4. It is ob- 
tained from F,. in Table 3 by multiplying each row by 1/2, where x 
is the first entry in the row. These reciprocals are shown in column 
D. of Table 3. Then the first entry becomes unity in each row of E, in 
Table 4. The same multiplier applies to all the entries in a row of F.. 

In Figure 2 let P,, P2, P;, represent primary trait vectors, and 
let the small circles represent test vectors, lying inside the unit sphere, 
that define a simple configuration in the spherical triangle. The ex- 
ample refers to three dimensions, but these principles are general. 
Let I be the first centroid axis which is necessarily inside the con- 
figuration of P,P.P;. Let IJ and III be the other two reference axes 
in the orthogonal reference frame for the centroid factorial matrix. 
If we are to look down into the configuration, as it were, along the 
first centroid axis, it will facilitate the appearance of bounding coor- 
dinate hyperplanes if the reference frame I, II, III, can be moved so 
as to enclose the configuration as far as possible. Landahl’s trans- 
formation rotates the reference frame I, II, III, to an arbitrary posi- 
tion ABC which is so chosen that the centroid axis I of the configura- 
tion makes equal angles with A, B, and C. The problem is then to 
rotate the reference frame from I, II, III, or preferably from A, B, C, 
to the unique position P,P.P;,; which defines the simple structure in 
the test battery. 

The matrix EF, of Table 4 is postmultiplied by the Landahl matrix 
M, in Table 5 in order to obtain the extended matrix E, in Table 
5. This table shows a convenient arrangement of the computations. 
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There are several minor modifications which a computer will intro- 
duce, including check columns and rows which are not shown in this 
numerical example. The work has been shown here so as to be as 
clear as possible as regards the principal operations. 

The computations thus far are summarized as follows: 

1. The given centroid matrix F, in Table 3 is written in the 
extended form E, of Table 4. This operation can be represented for- 
mally in the matrix equation, 


DF. = E-, (1) 


where D, is a diagonal matrix of order n * n whose entries are the 
reciprocals of the first entries in the corresponding rows of F.. In 
practice one does not write this diagonal matrix. One merely per- 
forms the multiplication for each row of F,. 

2. The matrix E, of Table 4 is multiplied by Landahl’s matrix 
M, to produce the extended factorial matrix FE, of Table 5. This can 
be represented formally in the equation 


E.M, = E,. (2) 


The matrix EH, has five columns in addition to the first column 
with entries of unity. These five columns give ten possible pairs. Each 
pair of columns is plotted on a separate graph. Four of these are 
shown in Figure 3. The purpose of the graphs is to find, by several 
adjustments, the bounding coordinate hyperplanes. On the graphs we 
look for linear arrangements of points. 

In the graph for columns C and D a line has been drawn through 
or near fourteen points. It has been denoted the D plane because the 
line is more nearly orthogonal to the axis D than to C. The naming 
of the planes is of no consequence except when the adjustments are 
small in the later trials. Then it is well to retain the identity of each 
plane in its notation. 

We write the equation of the line in the CD graph. This is done 
by inspection of the intercepts and the slope or in any other conveni- 
ent way. For this graph the equation can be written by inspection in 
the form 


.68C + 1.00D — .26=0. (3) 


These coefficients are written in the column D of the matrix S, in 
Table 5. Consider a vector which is a linear combination of the vec- 
tors C and D and the centroid vector J, namely, 


Ap = .68C + 1.00D — .26/ . (4) 
The coefficients of (4) are identical with those of (3) except that the 
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numerical term is taken as the coefficient of the vector J. The projec- 
tions of all the test vectors on the centroid vector are now unity. 
Equation (4) represents a new reference vector Ap. All of the four- 
teen points on or near the line on the graph CD will have vanishing 
or nearly vanishing projections on the new reference vector Ap. Fur- 
thermore, if the line on the graph is bounding so that all the points 
lie in or on one side of the line, then the projections on the new 
vector Ap will be like-signed. Points on opposite sides of a line in the 
graph will have projections of unlike signs. Since we are looking for 
a positive orthogonal manifold, we choose trial lines on the graphs 
that bound the configuration of points. 

If the signs of the coefficients in (4) are reversed, the effect will 
be to define Ap in the opposite direction. The absolute values of all 
the projections will remain the same, but they will have reversed 
signs. It is convenient to know that if the coefficients in (4) are writ- 
ten with the numerical term in (3) positive, then the points in the 
graph on the same side of the line with the origin will have positive 
projections, and points on the other side of the line will have negative 
projections. Inspection of the graph CD shows that the points that 
are not in or near the line are on the opposite side from the origin. 
Hence the coefficient of the vector J is written with negative sign. 
This has been done in column D of S, in Table 5. If the origin had 
been on the same side with the points that are not on the line in CD, 
the coefficients of (4) would have been reversed in sign so as to be 
—.68, —1.00, and +.26. There is no harm if this matter of signs is 
ignored because the whole column D in the next factorial matrix is 
then negative. When a column of negative signs is discovered, one 
has merely to reverse the signs of the direction cosines of the corre- 
sponding reference vector to make all of the projections positive. 

In the graph AD we have drawn a line close to twelve or more 
points. It is about equally separated from the first reference vectors 
A and D. 

It is arbitrarily denoted the A plane. There is going to be only 
one noticeably negative projection on this reference vector A, . 

The equation of the line may be written 


1.00A + 1.03D —1.50=0. (5) 


In this graph the origin lies on the same side of the line as the ma- 
jority of the points. Hence the equation is written with positive con- 
stant term, namely, 


—1.00A — 1.08D +- 1.50 =0, (6) 
and then the new reference vector A, will be 


A, = —1.00A — 1.08D + 1.507. (7) 
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These coefficients are entered in column A of S, in Table 5. 

In the graph BD a line has been drawn so as to bound the con- 
figuration and so as to lie near a large number of points. Its equation 
is written in terms of the variables B and D with a numerical term. 
The latter is taken as the coefficient of the vector J, and the three co- 
efficients are written in column B of S, in Table 5. 

The graph CE shows two possibilities, and hence two lines are 
drawn on this graph so as to bound the configuration as far as pos- 
sible and so as to lie near a large number of points. These are de- 
noted the C plane and the E plane. Their equations are written, and 
the numerical term is in each case taken as the coefficient of the vec- 
tor I. The coefficients are written in the columns C and E in S, of 
Table 5. With a little practice this operation of filling in the entries 
in the matrix S, becomes easily routinized. 

The columns of S, show the direction numbers of vectors that 
are collinear with the new unit reference vectors. The length of each 
of the five column vectors in S, must be determined. But the column 
vectors in S, are usually not orthogonal. Their lengths cannot be de- 
termined by the sum of the squares of the entries in each column of 
S, because the reference frame is not orthogonal. These vectors are, 
therefore, referred to the orthogonal centroid frame of E,.. From 
(2) we have E, by the transformation M,. Hence the column vectors 
in S, can be written in terms of the orthogonal frame of FE, by the 


transformation 
MS,=L,, (8) 


where columns of L, show the same vectors as the columns of S,. The 
column vectors of L, are referred to the orthogonal frame of E,. 
Hence the lengths of the column vectors of S, can be found from the 
corresponding columns of L,. The sum of the squares of the elements 
in each column of L, are recorded and also the reciprocal of the square 
root of that sum. The latter is the multiplier by which the correspond- 
ing vector in S, can be reduced to a unit vector. Let the multipliers 
be denoted d, and let these be the elements of a diagonal matrix D, . 
Then 


L,D, =M 1» (9) 
in which the columns of M, in Table 6 show the direction cosines of 
the new unit reference vectors A, in terms of the orthogonal refer- 


ence frame. 
Applying the same stretching factors in D, to the column vectors 


in S,, we have 
S,D, =T,, (10) 








206 PSYCHOMETRIKA 


which is the transformation that carries the extended factorial matrix 
E, into the next factorial matrix E,. Then 


E.T, = E,, (11) 


which is the starting point for the next cycle of rotations. The fac- 
torial matrix EF, can also be obtained directly from the extended cen- 
-troid matrix E,, so that 


E.M,= E,. (12) 


The computational work can be summarized as follows: 

1. When the entries in S, have been obtained from the graph, 
perform the matrix multiplication (8). 

2. Sum the squares of the entries in each column of L, and de- 
termine the reciprocal of the square root of each of these sums. The 
modulus d, is recorded under each column. 

3. Multiply each column of S, by the corresponding modulus d, 
to obtain T,, as shown in Table 5. This step is shown formally in 
(10). Also determine M, from (9). 

4. Multiply the matrix E, by T, to obtain E, in Table 6. This 
step is shown in (11). 

Table 6 shows the computations for the second cycle of rota- 
tions. The matrices E, and M, have already been obtained, as shown 
in (9) and in (11). First determine the scalar products in M’,M,. 
The side entries in this matrix show the cosines of the angular sepa- 
rations of the new reference vectors. If two of the new vectors are 
nearly collinear, a new plane should be found in the first set of graphs. 
One may carry more planes than there are dimensions, but in such a 
case they will converge so that before the final rotations are made 
there will be usually fewer meaningful planes than there are dimen- 
sions. 

The pairs of columns of EF, are plotted. Five of the diagrams are 
shown in Figure 4. Now the bounding planes are more clearly defined, ° 
and the rotations are all smaller in angular magnitude. The identity 
of each plane is easily retained. From the graphs we determine the 
entries in the matrix S.. The matrix multiplication, 


M.S, = I. , (13) 


corresponds to (8) for the first cycle of rotations. The reciprocals of 
the lengths of the column vectors in L, are recorded as d,. These 
multipliers on L. give 


LD, — M, ’ (14) 
which is recorded on the next data sheet. The multiplication, 
S.D. = Se ’ (15) 
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corresponds to (10) for the first cycle. Finally, 
E,T,=E., (16) 


which is recorded on the data sheet for the next cycle. 

For the third cycle we condense our description to the matrix S; 
in Table 7 and the graphs in Figure 5. These graphs show still smaller 
remaining movements. The fourth cycle has the matrix S, in Table 
8, which is determined from the graphs in Figure 6. These graphs 
show the planes to be so nearly in place that it is not worth the trouble 
to carry further approximations. They would affect only the second 
decimal place in the factorial matrix. 

The last step is to reduce each of the extended vectors to its origi- 
nal length. This is done by dividing each row of E, by the stretching 
factor D. which produced FE. from F,. The final matrix F’, can be ob- 
tained from the given matrix F,, by the transformation 


F,A,=F,, (17) 


where A, is anr X r matrix. If the first column of M, is deleted, the 
remaining matrix is A,. It is shown in Table 9. The columns of 
A, show the direction cosines of the unit reference vectors which de- 
fine the simple configuration. They are expressed in terms of the 
orthogonal reference frame of the initial centroid factorial matrix 
F,. The scalar products of columns of A, are shown in the matrix 
A’, Ay. The side entries in this symmetric matrix show the correla- 
tions between the primary factors in the fictitious test battery. These 
are practically zero. The final matrix F', of Table 9 is shown in the 
graphs of Figures 7 and 8. Table 9 is compared with Table 1, and it 
is seen that the approximation to the original simple structure is 
very close. Further rotations would make the two matrices identical 
within any required degree of accuracy, and the correlations between 
the primaries would, of course, also become exactly zero. The present 
example has been carried farther than would be necessary in any ex- 
perimental problem. 

The plots are made with orthogonal axes on the graphs even 
though the reference vectors are oblique. Angles between reference 
vectors that are less than 45 degrees are plotted to scale on the graphs. 
But it is not often necessary to represent on the graphs the true an- 
gle between the reference vectors. On each graph in this numerical 
example the cosine of the angle between the two reference vectors is 
recorded, but all the graphs are drawn with orthogonal axes as shown 
in the figures. 

In this method each of the coordinate planes is given the oppor- 
tunity to find its own location independently of the other planes. In 
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general, this gives an oblique set of reference vectors unless the un- 
derlying order is orthogonal. When the configuration turns out to be 
orthogonal without imposing orthogonality, it can be taken as good 
evidence of an underlying physical order. But we must consider the 
possibility of oblique configurations that are themselves of consider- 
able interest. Under certain conditions we should expect differential 
maturation of the factors to be responsible for an oblique configura- 
tion. At any rate, the method here described does not impose or- 
thogonality on the system. But orthogonality can be maintained in 
the successive rotations by watching the cosines in the product ma- 
trices M’'M if orthogonality is to be imposed. 


TABLE 1. A fictitious simple configuration 


; 8 2 & «= 
1 0 3 0 0 8 73 
2 0 9 0 0 0 81 
3 5 5 0 5 0 75 
4 5 af 0 0 0 74 
5 0 3 3 0 7 67 
6 0 A 6 A 0 68 
7 0 0 8 0 0 64 
8 0 5 sf 0 0 74 
9 0 0 6 6 0 72 
10 0 0 A 6 5 17 
11 6 0 5 A 0 17 
12 3h 3 3 0 0 67 
13 0 0 0 0 xy 81 
14 0 8 0 A 0 80 
15 6 0 5 0 3 -70 
16 6 0 6 0 0 12 
17 0 0 si 0 5 74 
18 4 0 0 0 7 65 
19 8 0 0 0 0 64 
20 0 0 0 6 6 72 
21 0 0 0 8 0 64 
22 3 0 0 4 7 74 
23 0 5 0 5 5 75 
24 ot 3 0 0 3 67 
25 3 0 0 8 0 73 


TABLE 2. An arbitrary orthogonal transformation A, 


I II III IV V 
Ps 447 —.148 —.092 —.832 —.282 
- 409 548 595 —.084 420 


i 447 —481 —.875 128 678 
at 447 554 —.554 307 —.307 
P 484 —.438 438 438 —.438 











TABLE 3. A centroid matrix F’, of the fictitious test battery 


CONIA Th WON eH 


CONIA tkwnd eH 
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I II ITI IV V D, 
510 —.188 529 325 —.224 1.9608 
368 489 536 —.076 378 2.7174 
652 474 —.026 —.304 —.084 1.5337 
510 306 870 —.475 153 1.9608 
596 —.273 373 320 .023 1.6779 
611 180 —.209 .166 452 1.6367 
358 —.345 —.300 102 542 2.7933 
517 —.030 035 .048 685 1.9342 
536 074 —.557 .261 .223 1.8657 
689 —.059 —.263 454 —.132 1.4514 
670 —.083 —.464 —.312 047 1.4925 
570 —.070 002 —.569 182 1.7544 
436 —.394 394 394 —.394 2.2936 
506 .656 254 056 .213 1.9763 
637 —.486 —.111 —.304 .038 1.5699 
5386 —.347 —.280 —.422 .238 1.8657 
555 —.521 —.044 309 256 1.8018 
518 —.366 270 —.026 —.419 1.9305 
3858 —.118 —.074 .—.666 —.226 2.7933 
559 070 —.070 447 —.447 1.7889 
358 443 —.448 246 —.246 2.7933 
652 —.129 057 180 —.514 1.5337 
.670 .330 .240 230 —.162 1.4925 
581 —.072 246 —A76 —.203 1.7212 
492 399 —.471 —.004 —.330 2.0325 
TABLE 4. Extended factorial matrix EF, 

I IT IIT IV V 
1.000 —.369 1.087 687 —.438 
1.000 1.329 1.457 —.207 1.027 
1.000 727 —.040 —.466 —.129 
1.000 .600 725 —.931 .300 
1.000 —.458 .626 537 039 
1.000 295 —.342 272 -740 
1.000 —.964 —.838 285 1.514 
1.000 —.058 .068 093 1.325 
1.000 138 —1.039 487 416 
1.000 —.086 —.382 659 —.192 
1.000 —.124 —693 —.466 .070 
1.000 —.123 004 —.998 .232 
1.000 —.904 904 904 —.904 
1.000 1.296 502 add A21 
1.000 —.684 —174 —.477 .060 
1.000 —.647 —.522 —.787 444 
1.000 —.9389 —.079 557 461 
1.000 —.707 521 —.050 —.809 
1.000 —.330 —.207 —1.860 —.631 
1.000 125 —.125 800 —.800 
1.000 1.237 —1.237 687 —.687 
1.000 —.198 .087 276 —.788 
1.000 493 358 493 —.242 
1.000 —.124 423 —.819 —.349 
1.000 811 —.957 —.008 —.671 
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TABLE 9. The simple configuration F', after four rotations 
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TABLE 7. The matrix S, 

I A’ B Cc’ D’ 

1.00 —.07 —.05 .07 04 
1.00 .06 
12 1.00 

1.00 —.05 

1.00 

—.14 


TABLE 8. The matrix S, 
I A’ B' C’ D’ 
1.00 -05 03 —.05 —.02 
1.00 .03 
1.00 07 
1.00 
1.00 
—.07 —.03 


A B Cc D 


-197 004 314 —.004 
—.009 —.011 900 —.004 
007 496 488 -005 
—.007 —.007 -687 001 
696 006 312 297 
003 400 399 601 
—.003 006 —.002 801 
—.008 —.001 498 699 
011 -605 .000 604 
513 -608 .009 401 
008 405 —.016 -506 
—.004 000 282 304 
900 009 017 —.003 
001 391 800 —.001 
298 008 —.010 508 
—.003 .006 —.016 -604 
A97 011 .007 -700 
-701 008 003 .000 
.000 002 —.020 004 
614 607 012 001 
020 801 001 004 
-710 408 006 .001 
507 499 510 —.002 
298 .001 288 .002 
019 802 —.007 006 





E’ 
—.10 


16 


1.00 


.08 


—.08 


1.00 


E 


.012 
027 
515 
522 
.008 
006 
—.007 
007 
—.006 
—.002 
595 
-706 
.003 
024 
597 
593 
—.006 
402 
801 
.002 
-000 
302 
017 
-710 
300 








I 
II 
III 
IV 
V 


FoVAw> 


A 


488 
—.428 
420 
445 
—.452 


A 
1.000 
034 
008 
—.008 
003 
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TABLE 9 (continued) 


B 
451 
539 

—.559 
311 
—.311 


1.000 
—.011 
.013 
002 


A, 

Cc 
407 
540 
605 

—.056 
AIT 


D 
448 
—.429 
—.382 
123 
674 


1.000 
—.004 


E 
456 
—.128 
—.068 
—.833 
—.277 


1.000 


213. 



































FIGURE 3 
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CENTROID ORTHOGONAL TRANSFORMATIONS 


H. D. LANDAHL 
The University of Chicago 


An initial transformation for facilitating the analysis of a fac- 
tor problem is discussed. Such a transformation is used in the ro- 
—— procedure developed by L. L. Thurstone in an accompanying 
article. 


The process of rotating from the centroids of a factored battery 
of tests to its solution configuration is often facilitated by the intro- 
duction of an arbitrary orthogonal rotation applied directly to the 
factorial matrix, provided the rotation has the property that the sum 
of its reference vectors is a vector which is collinear with the first 
centroid of the factorial matrix. There are an infinite number of ma- 
trices which satisfy this condition. It is the purpose here to justify 
the use of such a transformation and to give an appropriate trans- 
formation for dimensions. 

In test batteries where the intercorrelations are predominantly 
positive, the few negatives having small absolute values, most, if not 
all of the important factors have positive projections on the first cen- 
troid. Since these projections are, generally, unknown, and since 
there is no reason for introducing an oblique reference frame, it is 
not illogical to take an orthogonal rotation with equal projections on 
the first centroid. It can be shown that the sum of the reference vec- 
tors of such a rotation is a vector collinear with the first centroid. 

If the centroid axes are taken in pairs and the projections plotted, 
the graphs show an almost random scattering of points. Considering 
that these axes are centroids, such a random scattering is not unex- 
pected; in fact, the factorial matrix is the most unfavorable repre- 
sentation of the data. 

Let us present a procedure for determining each element of an 
nX n matrix and show that this matrix* is orthogonal, and that the 
sum of the columns as vectors is a vector collinear with the first cen- 
troid. 


1. Each element of the first row has the value 1/\/”, where n 
is the order of the n < n rotation matrix, that is, the number of di- 
mensions. 


* The matrix thus set up is neither unique nor determined by an unique pro- 
cedure, but is given in this form for convenience only. 
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2. The remaining elements of the last column, beginning at the 
last element and going up, are the first (7 — 1) terms of the series 
—.707, —.408, —.289, —.224, —.183, —.154, —.134, —.118, 

2 —1/VERE+SIE,.... 

3. Consider the (n —1) & (n —1) matrix resulting from the 
omission of the first row and the last column. The diagonal elements 
of this matrix, from lower right to upper left, are the first (n — 1) 
terms of the series 

.707, .816, .866, .894, .913, .926, .935, .9438, 
Jo VE/RLT), .... 

4. Each element below the diagonal of this (n — 1) (n — 1) 
matrix is zero. 

5. In each row, the remaining elements are each equal to the 
value of the last element of that row of the n < n matrix. Note that 
these elements are all negative. 

A necessary and sufficient condition that a matrix transforma- 
tion be orthogonal*} is that 

> ip = 1, 
pHl1 


and 
4 ues se aR, oes 
ZAipflin = 0, tA), fG=1,2---n. 

For the 7" row, the value of & in either of the above series is 
given by (n —i-+1),i> 1. The first element in the 7‘ row, other 
than zero, is an element of the (n — 1) & (n — 1) matrix and its 
value is \/ (n — i+ 1)/(m —i-t 2). Each of the remaining (n — 
i+ 1) elements of the 7" row has the value 


—1/V(n—i+2)™m—it}). 


e 9 _ +1 , 1 — 
=a 7 Menthe i nti Bed (n—1+2) (n—7+1) 








Then 





The sum of the squares of the elements of the first row is also unity 
since each element has the value 1/\/n. The first part of the condi- © 
tion is then satisfied. 

If we take j > i, the only elements of the i" row not multiplied 
by zero in carrying out the second part of the condition are the last 
(n — j + 2) elements. These elements are identical and have the 
value —1/\/ (n—i-+ 2) (n—i-+1) fori >1,and 1/7 fori =1. 


* Muir, Thomas and Metzler, William H., A Treatise on the Theory of De- 
terminants, Menasha, Wisconsin: George Banta Publishing Company, 1929, p. 


566. 





+ The condition may be stated with all subscript pairs interchanged. 
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In either case there is a common factor which we may designate as 
f;. We then have, fori+j7, 





: —_¢/ /w—j+1 : , ; - 
Disp - f ere. ame (n—Jj+1) V1/ (n—j-++2) OTD | oe 


The second part of the condition for orthogonality is then satisfied. 
Moreover, since the factor which vanishes is the sum of the elements 
of the 7" row, 7 > 1, the sum of the columns as vectors is a vector col- 
linear with the first centroid. 

A table of matrices, set up in accordance with the above proce- 
dure, is given for computational convenience. Alternative matrices 
are included for the cases n = 4 and n = 8. In each of these two 
cases, the absolute value of each element is 1/\/n. By inspection it 
can be seen that they satisfy the imposed conditions. One might draw 
attention to the fact that the mean of the entries in the first row of 
the final rotation matrix in the study “Primary Mental Abilities’’* 
is .285, and the standard error is .053. The value of 1/\/n, where 
n = 12, is .289. 

It may be that one} of the final reference vectors can be approxi- 
mately specified from a knowledge of the factorial composition of one 
or more of the tests. A previous study or other considerations may 
be the source of this knowledge. The following suggestion permits a 
most advantageous use of this information. 

Let the column matrix |a,,| represent the known vector and let 
M be the given centroid transformation matrix. Take the product 


given by the expression })|a@,,a,;| for each j, and let q be the value of 
p=2 


j for which the expression has the greatest value.t Now change the 
signs§ of the rows of M so that the column q has the same arrange- 
ment of signs as the column of |@,|._ Then 


|@pn|’|@pq| = Z| aantty 


The vector represented by column q is sufficiently close to that of 
|@y,| so that the new matrix may be directly applied, since usually the 


*Thurstone, L. L., Primary Mental Abilities. 1988, Psychometric Mono- 
graphs No. 1. 

+ The following may be generalized, approximately, for more than one vector. 

t Most of the columns can be eliminated by inspection. 

§ One might interchange the rows and obtain a still larger value for the ex- 
pression given above, especially for the case of more than one known vector. 
Note that the orthogonality of a matrix is not affected by interchanging rows or 
~~ eens signs of rows; this fact is evident from the condition for orthogo- 
nality. 
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error in the approximately known vector is fairly large. If it is de- 
sired, however, the values of the matrix |a,,| may replace the column 
q. In this case the resulting matrix transformation is not exactly a 
centroid rotation nor exactly orthogonal. Even in this latter case few 
if any of the first set of graphs would require oblique plotting. Be- 
cause the configuration is either orthogonal, or approximately so, 
there is an excellent chance that the graphs will indicate definite ro- 
tations of some of the vectors with the known vector. It is even more 
probable that the approximately known vector will show a rotation 
leading toward its final stable position. 


TABLE OF MATRICES 


577 = 577 577 500 .500 .500 .500 500 .500 .500 .500 
816 —.408 —.408 .866 —.289 —.289 —.289 500 .500 —.500 —.500 
000 = .707 —.707 000 .816 —.408 —.408 500 —.500  .500 —.500 

000 .000 .707 —.707 .500 —.500 —.500 .500 
ATi AT ATT ATI ATT 408 408 .408 .408 .408 .408 
894 —.224 —.224 —.224 —.224 .913 —.183 —.183 —.183 —.183 —.183 
000 .866 —.289 —.289 —.289 000 .894 —.224 —.224 —.224 —.224 
000 .000 .816 —.408 —.408 000 .000 .866 —.289 —.289 —.289 
000 .000 .000 .707 —.707 000 §=.000 6.000 §©.816 —.408 —.408 


000 §©.000 .000 000 .707 —.707 


378 «378 6.3878 «6.3878 §=6.878 )~=—_ «378 ~—( 878 
926 —.154 —.154 —.154 —.154 —.154 —.154 
000 § .913 —.183 —.183 —.183 —.183 —.183 
000 .000 .894 —.224 —.224 —.224 —.224 
000 .000 .000 .866 —.289 —.289 —.289 
000 .000 .000 .000 .816 —.408 —.408 
000 .000 .000 .000 .000 .707 —.707 


354 854 .354 .854 .854 .854 .854 .354 
935 —.134 —.134 —.184 —.134 —.134 —.134 —.134 
000 .926 —.154 —.154 —.154 —.154 —.154 —.154 
000 .000 .913 —.183 —.183 —.183 —.183 —.183 
000 .000 .000 .894 —.224 —.224 —.224 —.224 
.000 .000 .000 .000 .866 —.289 —.289 —.289 
000 .000 .000 .000 .000 .816 —.408 —.408 
000 000 000 000 .000 000 .707 —.707 


354 354 854 854 854 .854 .854 .854 
354 —.354 .354 .354 —.354 —.354 —.354 .354 
354 354 .354 —.354 —.354 .854 —354 —.354 
354 .854 —.354 —.354 .354 —.354 —.354 .354 
354 —.354 —354 354 .854 .3854 —.3854 —.354 
354 —.354 .354 —.354 .854 —.354 .3854 —.354 
.354 —.354 —.354 —.354 —.354 .3854 .854 .354 
354 354 —.354 3854 —354 —354 3854 —.354 








——_ a soe ’ 
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383 .8838 §©.883 «= «8388.—s(«833) C883) 883883888 
943 —.118 —.118 —.118 —.118 —.118 —.118 —.118 —.118 
000 .985 —.184 —.184 —.134 —.134 —.134 —.134 —.134 
000 .000 .926 —.154 —.154 —.154 —.154 —.154 —.154 
000 .000 .000 .918 —.183 —.183 —.183 —.183 —.183 
.000 .000 .000 .000 .894 —.224 —.224 —.224 —.224 
.000 .000 .000 .000 .000 .866 —.289 —.289 —.289 
000 .000 .000 .000 .000 .000 .816 —.408 —.408 
.000 .000 .000 .000 .000 .000 .000 .707 —.707 
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A THEORY OF LEARNING AND TRANSFER: II 


HAROLD GULLIKSEN AND DAEL L. WOLFLE 
The University of Chicago 


The theory of discrimination learning developed in Part I* of 
this paper is extended to more complex learning situations such as 
those used for differentiating relative from absolute habits. The 
greater difficulty of learning an absolute discrimination than of 
learning a relative discrimination is predicted. The theory is also 
extended to cover cases of transfer. It predicts that under the or- 
dinary training conditions transfer to new stimulus situations will 
be on a relative basis, and it predicts the type of transfer to be ex- 
pected under other training conditions. A number of additional es- 
tablished facts of learning and transfer can be deduced from the 
theory. Methods of testing the adequacy of the theory by new and 
more crucial experiments are suggested. 


FOUR-CONFIGURATION PROBLEM 


The line of reasoning employed in the analysis of situations in- 
volving two configurations can be extended to cases in which four con- 
figurations are used in the training. Four configurations, for example, 
must be used in establishing an absolute brightness discrimination 
and may be used in several other types of experiments. We will give 
several illustrations of the diverse experiments which are suggested 
by the present approach. Most of them, to our knowledge, have never 
been performed. In order to make the problems concrete, the illus- 
trations will be made in terms of brightness discrimination. Consider 
the four configurations shown in figure 6, and diagramed as points 
1, 2, 3, and 4 in figure 7. 

Six different experiments are possible with different pairings of 
these four configurations. Two involve relative discrimination; two, 
absolute discrimination; and two, discrimination of total brightness. 


Relative discrimination: 

A. If a jump to the left is demanded in response to configurations 
1 and 3, and a jump to the right in response to configurations 2 and 
4, the experiment would usually be described as training the animal 
to respond to the relationship “brighter than.” 

B. If the responses demanded in A are reversed for all four con- 


*Gulliksen, Harold and Wolfle, Dael L. “A Theory of Learning and Trans- 
fer: I,” Psychometrika, 8 1988, 127-149. 
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I. 3. 
8 2 32 8 
2. 4. 
2 8 8 32 


FIGURE 6 
Four configurations to be used in train- 
ing for relative discrimination, absolute 
discrimination, or discrimination of total 
brightness. 
The numbers beneath the circles repre- 
sent the relative brightness of the stimu- 
lus cards. 


I. 3. 
2 2 32 32 
f. @ e- 
“4 32 32 2 


FIGURE 8 
Four configurations to be used in train- 
ing for the six problems described on 
page 227. 





Absolute discrimination: 


lute brightness of 8 units. 


ness of 8 units. 
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figurations, it would be said that the animal was being trained to re- 
spond to the relationship “dimmer than.” 


C. If a jump to the left is demanded in response to configura- 
tions 1, and 4, and a jump to the right in response to configurations 
2 and 3, the animal is being trained to respond positively to an abso- 


D. If the responses demanded in C are reversed for all four con- 
figurations, the animal is being trained to avoid an absolute bright- 
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FIGURE 7 
Plot of four configurations. 
By different pairings of these 
four configurations, training 
may be given on any of the 
six problems described on 
pages 225-227. 
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FIGURE 9 
By different pairings of these 
four configurations, training 
may be given on any of the 


six problems described on ' 


page 227. 
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Discrimination of total brightness: : 

E. If a jump to the left is demanded in response to configura- 
tions 1 and 2, and a jump to the right in response to configurations 
3 and 4, the experiment may be described by saying that when the 
total brightness is low (10 units) the animal is required to jump to 
the left. When the total brightness is high (40 units) it is required 
to jump to the right. This experiment might also be described by say- 
ing that the animal is trained to go to the left whenever one of the 
lights is very dim and to the right whenever one of the lights is very 
bright. 

F. As formerly, the responses demanded in FE may be reversed 
for all four configurations. 

We do not know that the last three experiments have ever been 
performed. 

It is possible to make a similar analysis involving any set of four 
configurations. If the four configurations illustrated in figure 8 and 
diagramed in figure 9 are used, there are, again, six different experi- 
ments which may be performed. If, for example, a jump to the left 
is demanded in response to configurations 1 and 3, and to the right 
in response to configurations 2 and 4, we are training the subject to 
go to the left when the two stimuli are the same and to the right when 
they are different. As before, this training may be reversed. The 
other four experiments with this set of configurations are all essen- 
tially alike and we will give only one illustration. If we train the sub- 
ject to jump to the right in response to configurations 1 and 2, and 
to the left in response to configurations 3 and 4, we may describe this 
experiment by saying that we require the subject to attend to the left 
Atimulus of each configuration. If that stimulus is dim he goes to 
‘ the right; if it is bright he goes to the left. The three other experi- 
ments are analogous to this one. 

With each possible set of four configurations six different experi- 
ments could, in general, be performed. Of all experiments possible 
with four configurations, only two types have been reported in the 
literature — training the animal to discriminate relative brightness 
and training it to discriminate absolute brightness. The theoretical 
formulation to be given should apply to any possible experiment with 
four configurations and hence may be tested not only in the conven- 
tional relative and absolute discrimination problems but in a large 
number of new experimental situations as well. 

Definitions 

The theoretical formulation may now be extended to include the 

four-configuration problem. 
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Using odd-numbered subscripts, 1 and 3, to designate contenu 
» tions for which one response is required and even-numbered sub- 
scripts to indicate configurations for which the other response is re- 
quired, by analogy with equations (II-7) and (II-8) we wid write 
for configuration 1: y yr" > Nea 


| du, _ a CU, — O(4Us + IesaW, + KyWs al ‘ 


8, dw, hy kw, + kygws — Crate — Cay, 





The different terms have the same meanings as in equation (II-7). 
k,; and e,; (7 = 2, 3, or 4) represent the influence of reward or pun- 
ishment of a response to configuration 7 upon the strength of the ten- 
dency to give the same response to configuration 1. 

The differential equations for configurations 2, 3, and 4 may be 
obtained from (IV-1) by the proper interchange of subscripts. Inter- 
changing subscripts 1 with 2 and 3 with 4 gives the differential equa- 
tion for configuration 2. Interchanging subscripts 1 with 3 and 2 
with 4 gives the differential equation for configuration 3. Interchang- 
ing subscripts 1 with 4 and 2 with 3 gives the differential equation 
for configuration 4. 

We will solve these equations for the special case in which the 
four configurations are so chosen that, when plotted as in figure 7, 
they lie on the corners of a rectangle. In this case: 

The distance 1-2 equals the distance 3-4. 

The distance 1-3 equals the distance 2-4. 

The distance 1-4 equals the distance 2-3. 

From these equalities the learning parameters may be redefined as 
follows: 


Cre = Cy = C1, (IV-2) 
ke = ky =k, , (IV-3) 
Cis = Cog = C2, (IV-4) 
Kizs=—Kn =k, ~ (IV-5) 
Cis = Cog = C3, (IV-6) 
Kes, = Kees = eg . (IV-7) 


The subscript 1 now indicates the distance from any configuration 
to the closer, and the subscript 3 indicates the distance to the farther, 
of the two configurations to which the opposite response is required. 
The subscript 2 indicates the distance to the other configuration to 


* The number of each equation dealing with the four-configuration case will 
be preceded by IV. 
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which the same response is required. 
It is assumed that 9:—=92=93 = 9%; 
and that h,=h=h,=h. 


Substituting equations (IV-2) to (IV-7) inclusive in (IV-1) we 
have — with changes of subscripts indicated above — four equations. 
Making the assumption that (see equation II-10) 


W, = W2 = W3 = Wi, (IV-8) 


we may by a process similar to that outlined in conjunction with 
equation (II-13) show that 


Uy = Us = Us = Uy (IV-9) 


under the same restrictions as in the previous case. 


Derivation 


The identities of equations (IV-8) and (IV-9) make it necessary 
to solve only one equation, namely 


du _g9— (€+¢2)u+ (k, + ks) w 





dw ht(k+kh)w— + e,)u. (IV-10) 
By setting 
(c+) =c, (IV-11) 
(k++ hk) =k, (IV-12) 
(c,-+ ¢,) =c’, and (IV-13) 
(kA, + k3) =k’, (IV-14) | 


it can be seen that equation (IV-10) is formally equivalent to equa- | 
tion (II-15) in the two-configuration case. 

Substituting equations (IV-11) to (IV-14) inclusive in equations 
(11-33), (II-38), (II-39) or (II-40) gives the corresponding solutions 
for equation (IV-10). Similarly the equations for the asymptotes, 
their intersection, the tangent or secant of the angle between them, 
and the semi-major axis of the equation for the four-configuration 
problem may be written, by analogy, from the corresponding equa- 
tions of the two-configuration problem. 

To make a direct and simple comparison of the two- and four- 
configuration cases we will use only equation (II-40) of the former 
derivation in which g = h and c = k. This equation is 


2kuw 29qu 2gw 
ut + wt — — 4 = ug: (11-40) 
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Making the substitutions indicated in equations (IV-11) to (IV-14) 
and setting c; = k; gives the corresponding four-configuration form 
as follows: 







+ ke 29 “a 
oo. La, 
(IV-16) 






u? +- w? — 2 






Properties of the Learning Curve for the Four-Configuration Problem 





The slope of the asymptote of the learning curve for the four- 
configuration problem is: 


I+ Ken — Ve te)? — (a la)? 































k, +k; 
If a new variable, x, be defined as 
_k+h x. 
the slope of the asymptote may be rewritten as 
2— Vx?—1. (IV-18) 


The derivative of this expression is negative for values of x from 
1 to o, which shows that as x increases above values of 1, the slope 
of the asymptote decreases. If x < 1, the radical becomes imaginary, 
and if « = 1, the slope of the asymptote equals unity. Consequently, 
if learning is to occur, the condition 


k+k, 


kek, >1 (IV-19) 


must be satisfied. 


Application to Experimental Data 


For any given set of four configurations, k will always retain the 
same value for a given subject regardless of whether he is being 
trained on a relative, absolute, or total size discrimination. The value 
of « [Equation (IV-17)] will, then, depend upon the relative sizes of 
k, , k2 , and k,, which will vary with the particular problem. 

In figure 10, the four configurations are plotted. The symbols 7, 
m and n represent the amount of transfer from configurations 2, 3, 
and 4 to configuration 1. Since transfer is an inverse function of dis- 
tance it is obvious that: 
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k>m>n,and 
k>p>n. ) 


In the case of relative discrimination one response is given to 
configurations 1 and 3 and the other response to configurations 2 and 
4. In this case k, = p, k, = m, and k; = n. The value of x [Equa- 
tion (IV-17)] will then be 


k+m 
x ee (IV-20) 
In the case of absolute discrimination one response is given to 
configurations 1 and 4 and the other response is given to configura- 
tions 2 and 3. In this case k, = p, k, = n, k, = m. The value of z 
will then be 
k+n 
x cis (IV-21) 
(1) It may be deduced that the wltimate level of accuracy (as rep- 
resented by the slope of the asymptote) attainable in an absolute dis- 
crimination will be lower than that attainable in a relative discrimina- 
tion involving the same four configurations. 
——» The level of accuracy attainable will be less when the asymptote 
of the learning curve has a greater slope. From equation (IV-18), it 
is clear that the slope of the asymptote increases with a decrease in 
the value of x. Since m > n, x in equation (IV-20) is greater than x 
in equation (IV-21), i.e., 


lt hee ktn 
p+n~ p+m- 


If the slope of the asymptote is taken as a measure of the difficul- 
ty of the problem, the conclusion is reached that an absolute bright- 
ness discrimination is more difficult than the corresponding relative 
discrimination. 

No work has been reported on the ultimate level of accuracy at- 
tainable under the two types of conditions. But if the two conditions 
differ in this respect, it is to be expected that more trials will be re- 
quired to reach a given level of accuracy with an absolute than with a 
relative discrimination. This expectation is borne out by the experi- 
ments of Warden and Rowley (30)* using the ring dove. Warden and 
Winslow (31) using the ring dove, and Wolfle (33) using the white 


rat. 





* The references given in this paper will be found in Psychometrika, Vol. 3, 
pages 148-149. 
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STIMULUS ON LEFT 





1 2 4 8 6 32 64 128 
STIMULUS ON RIGHT 


FIGURE 10 


Plot of four configurations. The values of k, p, m, and n 
represent the amounts of transfer from configurations 1, 
2, 3, and 4 to configuration 1. 


TRANSFER-T 














NUMBER OF CORRECT RESPONSES-W 


FIGURE 11 


The relationship between consistency of transfer and 
previous number of correct responses to the training con- 
figuration for three different test configurations. 


hw 
(2) -If an animal is trained to go to the left (or right) in response 
to configurations 1 and 3 of figure 7 and to the right (or left) in re- 
sponse to configurations 2 and 4, and if the distances from 1 to 2 and 
from 3 to 4 are held constant, it may be deduced that the ultimate 
level of accuracy, as measured by the slope of the asymptote, is the 
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same when configurations 1 and 3 (and 2 and 4) are far enough apart 
to be essentially independent as it is when they are so close together 
that they are practically identical. 

If configurations 3 and 4, in figure 7, are very far from configura- 
tions 1 and 2, k, and k; (in the case of relative discrimination) are 
essentially zero. In this case 


If, on the other hand, configurations 3 and 4 are gradually brought 
closer to 1 and 2 until 1 and 3 are identical and 2 and 4 are identical, 
k, approaches k and k, approaches k,. In this case the four configura- 
tions have reduced to two. In this case also 


The values of x and the slopes of the asymptote must be the same for 
the two extreme cases. The optimum of ease or difficulty between 
these limits will depend upon the function relating k to the psycho- 
logical distance between the two configurations involved. “Ease” or 
“difficulty” refers here to the ultimate level of accuracy attained re- 
gardless of the number of trials or errors required to reach this level. 


TRANSFER OF RESPONSE TO NEW CONFIGURATIONS 
The Two-Configuration Case 
The theory may be extended to problems of transfer. If an ani- 
mal has been trained to jump to the right when presented with con- 
figuration a and to the left when confronted with configuration b, we 
may designate a jump to the right as an a-response and a jump to the 
left as a b-response. What response will the animal now give if it is 
confronted with any new configuration, 7? 
Let a; = the initial strength, prior to any training on a and B, of the 
tendency to give the a-response to configuration 7, 
and b; = the initial strength, prior to any training on a and b, of the 
tendency to give the b-response to configuration 1. 


If t, denotes the cumulative count of a-responses given to i, and # de-. 


notes the cumulative count of b-responses given to i, then, 

following the formulation of the two-configuration case, we may write 
dt, __ & + hiaWa — Cintly 

dt, ce b; oe Ciqla + kiywo : 





(T-1)* 


* The number of each equation dealing with the problem of Transfer will be 
preceded by T. 
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Here the w’s and w’s represent the cumulative count of incorrect and 

correct responses given to configurations a and b. The coefficients of 

these variables are functions of the animal’s learning ability and the 

distance from 7 to a and to b. The initial differential equation differs 
only in its subscripts from that for the two-configuration problem. 

dt 

If ii, = 1, then the tendency to give the a-response will be equal 

to the tendencey to give the b-response, and there will be no consistent 

tendency for either the a-response or the b-response to predominate. 

dt, dt, 
<1, 


If = > 1, then the a-response is more likely to be given. If ai 
b b 


: ‘ ; ; us 
the b-response is more likely to be given. Since bod represents the 
b 





amount and direction of transfer to be expected, we may substitute 
dt, 

T for dh, . 

If the initial strength a; and b; as well as the coefficients of the 
u’s and w’s were known from previous experiments it would be easy 
to substitute the cumulative correct and incorrect responses to both 
aand b at any stage of learning and predict the relative strengths of 
the a-response and b-response tendency for any configuration 7. 

As with the learning problem, we may simplify the treatment 
by considering’ only the case where 


a,=0;, 
Ua = UW, and 
We = Wo. 


If, in addition, we assume that c = k, the transfer effects will 
show up more clearly. With these assumptions (T-1) becomes 


a; a kigw — kinu 


; ” ie a; — kiqu + kipw ; 


(T-2) 





From equation (II-41) we can see that u = p(w) so that substi- 
tuting (II-41) in (T-2) will give 


k sie Nts 
(a; — ki) + (kia — kin) w + ki VF (w) 





T= ’ (T-3) 


k aw 
(a; — hi) + (kin — Kia) w + Kia VF (w) 


ret [eae CEB) 


where 
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Properties of This Equation 


(1) It can be seen from equation (T-2) that if ki, = ki», then T is 
unity for all values of w (See figure 11, curve 1.). 

(2) As ki, decreases and kj, increases, T becomes smaller. When 
i= b,ki» = k, which is its maximum value, and ki, = k’. For this 
point, the transfer values may be designated by T,. Substituting 
these values in equation (T-3) and setting a; = g, gives 


k F’ (w) 
T, = —— Sia T-4 
If w = 0, T is unity and T, is unity, and as 


ke 
Ww Poa ei: 
The first derivative of (T-4) is negative, so the curve relating trans- 
fer to previous numbers of correct responses is as shown in figure 11, 
curve 2. 
(3) When i = a,we will have large T-values, designated by T,, for 
any given w. This transfer curve is found by setting ki, = k’ and 
ki, = k. It can be seen from equation (T-2) that an interchange of 
k;, and kj, gives a new value of T which is the reciprocal of the former 
one, so that the curve for T, is equal to 1/7. This curve is shown as 
curve 3 of figure 11. 

The derivative of (T-3) may be written 


ti oy 79) | cae p(M+ 2p 4 M VF) | 


dw ki k ; f 
VF(w) l(ae— r+ je re)” + Fw) | (T-5) 


where F'(w) has the same meaning as in equation (T-4) and 


-$(0+2) ale +1). 


It can be seen from equation (T-5) that cl will become zero under 





three conditions. 
(1) Dividing both numerator and denominator by w, it can be seen 


that as w > o, a — 0, Thus, as the amount of practice becomes 


very large, T approaches a stable value. 
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_— c 
(2) If kia = Kip — will equal zero for all values of w, as shown in 


curve 1 of figure 11. 

(3) Any value of w for which the bracketed expression in the nu- 
. aT 

merator of (T-5) vanishes will make , equal to zero. If the second 

derivative is negative or positive, such values of w will represent the 

amount of training for which transfer is, respectively, either a maxi- 

mum or a minimum. 


Applications to Experimental Data 


(1) . It may be deduced that: If the animal has been trained to go to 
the larger (or smaller) of two stimuli regardless of whether this 
stimulus is on the right or left side, it will respond on a relational 
basis in transfer tests. 

Assume that the animal was trained to go to the left for con- 
figuration A and to the right for configuration B, that is, to the larger 
of the two stimuli, as diagramed in figure 12. The tendency to jump 
to the left will be equal to the tendency to jump to the right for all 
configurations for which T equals unity. These will be configurations 
for which k;, equals kj,, i.e., those which are equidistant from A and 
B. These configurations will be on the diagonal line shown in figure 
12. Since T equals unity for all configurations on the line, it may be 
called the line of indifferent transfer, or the “indifference line.” All 
configurations on this line are composed of two stimuli which are 
equal in size. For configurations lying above and to the left of this 
line, ki, < ki, , since these configurations are nearer to A than to B. 
It can readily be seen from equation (T-2) that in this case T > 1, 
that is, the tendency to give the A-response is stronger. Since the 
A-response in this illustration is a jump to the left, this means that 
the animal will jump to the left for all configurations above and to the 
left of the line in figure 12. Since these are configurations with the 
larger stimulus on the left the animal will also be jumping to the 
“larger of the two” stimuli. For configurations below and to the right 
of the line, the B-respense will be stronger. This response is a jump 
to the right in the present illustration and these configurations are 
ones in which the stimulus on the right side is larger. Thus it can be 
seen that jumping to the left for configurations more like A and to 
the right for configurations more like B may also be described as 
jumping to the “larger of the two” in every case. We have then the) 
familiar fact of “transposition” or “transfer on a relational basis” ; 
(3, 6, 7, 8, 14, 22, 26) predicted from a theory which did not assume 
the existence of relational response tendencies. ‘ 
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Figure 12. If the animal is trained to jump to the left in response to 
configuration A and to the right in response to configuration 
B, it will jump to the left in response to any configuration 
above and to the left of the diagonal line and to the right 
in response to any configuration below and to the right of 
the diagonal line. Responses to configurations on the diagon- 
al line will be made on a chance basis. 


Figure 13. If the animal is trained to jump to the left in response to 
configuration A and to the right in response to configuration 
B, it will respond to test configurations by jumping to the 
right in response to all configurations above and to the right 
of the diagonal line and to the left in response to all con- 
figurations below and to the left of the diagonal line. Re- 
sponses to configurations on the diagonal line will be made 
on a chance basis. 


Figure 14. Transfer in a four-configuration relative discrimination 
problem. If the animal has been trained to jump to the left, 
in response to configurations a and c, and to the right in 
response to configurations 6 and d, it will respond on test 
trials by jumping to the left to any configuration above and 
to the left of the diagonal line and to the right in response 
to any configuration below and to the right of the diagonal 
line. Responses to configurations on the diagonal line will 
be made on a chance basis. 


Figure 15. Transfer in a four-configuration total brightness discrimina- 
tion problem. 
If the animal has been trained to jump to the left in response 
to configurations a and ¢ and to the right in response to con- 
figurations 6 and d, it will respond on test trials by jump- 
ing to the left to any configuration below and to the left of 
the diagonal line and to the right in response to any configur- 
ation above and to the right of the diagonal line. Responses 
to configurations on the diagonal line will be made on a 
chance basis. 


Figure16. Transfer in a four-configuration absolute discrimination 
problem. If the animal has been trained to jump to the left 
in response to configurations a and c and to the right in re- 
sponse to configurations 6 and d, it will respond on test 
trials by jumping to the left in response to any configura- 
tion in the shaded areas and by jumping to the right in 
response to any configuration in the unshaded areas. Re- 
sponses to configurations on either of the two diagonal lines 
will be made on a chance basis. 
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Spence (26) has presented some evidence which indicates that 
under certain special conditions the animal may reverse the usual 
tendency to respond on a relational basis in transposition tests. 

Lashley (19), in a discussion of the basis of response of the rat, 

and Hebb (6, 7), in an experiment designed to test the rat’s innate 
visual organization, both reach the definite conclusion that the ani- 
mal’s response is based on relational characteristics of the two stimuli 
being discriminated. In both cases, the only evidence cited is the fact 
that in transfer tests the animal responds on a relational basis. Tans- 
fer on a relational basis is, in these articles, interpreted as evidence 
that learning was on a relational basis. It has now been demonstrated 
that relative transfer can be predicted from a theory based on a di- 
rectional definition of the response. Since this prediction is possible 
it is not legitimate to use the fact of relational transfer to support 
the assumption that the animal’s original learning was relational in 
nature. 
(2) It may be deduced that: If the animal has been trained to go to 
one side in response to configurations containing large stimuli, and to 
the other side in response to configurations containing small stimuli, 
it will respond on the basis of total size of stimuli in transfer tests. 

Assume that the animal was trained to go to the left for con- 
figuration A and to the right for configuration B, as diagramed in 
figure 13. Assume further that the logarithmid, assumption, on the 
basis of which the configurations in figure 13 were plotted, is correct. 
Then by a line of reasoning analogous to that followed above, it can 
be seen that the configurations for which the tendency to jump to the 
left is equal to the tendency to jump to the right will lie on the “‘indif- 
ference line” shown in figure 13. For configurations lying above this 
line the B-response will be stronger, and for configurations lying be- 
low this line the A-response will be stronger. In other words, for 
configurations containing large stimuli the B-response will be given, 
and for those containing small stimuli the A-response will be given. 

Experimental determination of where the indifference line runs 
in this case would give some important properties of the configura- 
tional field involved. For example, if the logarithmic scale suggested 
to conform with Weber’s law is correct, the indifference line should 
lie where the product of the two brightness values involved in the 
configuration is equal to the geometric mean of the two brightness 
values used in training. Variations from this line would indicate the 
type of modification of the logarithmic principle that should be made. 
(3) There are two deductions concerning the relationship between 
accuracy of transfer and amount of training: 

(a) It may be deduced that: For certain values of the parameters 
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the accuracy of transfer togny given configuration will increase with 


an increase in training. That is, if the configuration 7 is nearer to B 


than to A, the relative strength of the tendency to give the B-response 
to configuration i will increase with increased training on the A-B 
discrimination. Conversely, if the configuration is nearer to A than 
to B, the relative strength of the tendency to give the A-response to 
that configuration will increase with increased training on the A-B 
discrimination. 

That this deduction follows from the properties of equation (T-3) 
can be seen from the curves in figure 11 and from equation (T-4). 
Since the accuracy attained increases with increased training, a corol- 
ey of the foregoing deduction would be that: The accuracy of trans- 


curacy attained_in the original discrimination. Gulliksen (5) has 


- shown that the consistency of transfer by white rats in a size-bright- 


ness discrimination problem increases with an increase in the accu- 
racy attained before giving the transfer tests. Hovland (10), work- 
ing with a conditioned galvanic response to tonal stimuli, found that 
with increased training there was a gradual increase in the absolute 
magnitude of the galvanic response to tonal stimuli different from 
the training one. 

(b) It may also be deduced that: For other values of the parameters 
of the equation, the accuracy of transfer to any given configuration 
will first increase, and then decrease with increase in training. 

That this deduction follows can be seen from equation (T-5), the 
derivative of the transfer equation (T-3). 

Hovland (10) really found a steady increase; plotting the ratio 
of generalized to conditioned responses gave his. figure 2, an appar- 
ent decrease in generalization. 

In the case of both Gulliksen’s and Hovland’s data it would be 
necessary to know the values of the various parameters involved in 
the transfer equation (T-3), in order to determine whether or not 
these experiments verify the theory. 

By combining learning and transfer problems it is possible to get 
additional checks on the theory. For example, if a given configura- 
tion 7 is equidistant from a and b as shown by transfer experiments, 
then learning experiments should show that a problem requiring a 
discrimination between 7 and a is equal in difficulty to one requiring 
a discrimination between 7 and b. Difficulty, as used here, means the 
ultimate level of accuracy attained. That is, for animals with the 
same tary ability k, the secant of the angle between the asymp- 


totes (+ a should be the same for both experiments. 
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Similarly, if 7 is nearer to a than to b as shown by transfer ex- 
periments, then, in learning experiments the discrimination between 
i and a should be less accurate than the discrimination between 7 and 
b. For animals of the same learning ability, the ratio of the secants 


of the angles between the two asymptotes e+e should equal 
ib ia 


Kia 


. See equation (T-2). 
kip 





The Four-Configuration Case 
Extending formula (T-1) to the four-configuration case is rela- 
tively easy. 
dt, _ ut KigWa + kicWe — Ciny — Cidlla (T-6) 


dt, 5 — CigUa — CicUe + KinpWn + KiaWa 


The deductions regarding transfer will stand out more clearly if we 
assume 








We = Wy = We = Wa = W, 
Ug = Un = Uc = Ua = 4, 
a, = bi, and 

k= ¢;. 


In addition, substituting T for — gives: 
b 

_ + (Kia + Kic) w — (kin + Kia)u 

aj — (Kia + kicdu+ (ki, te kia) w ; 
This equation has the same properties as equation (T-1) if the ap- 
propriate change in coefficients is made. The foregoing deductions 
hold in an analogous manner. 

We will consider here only the indifference lines for the four-con- 

figuration case. It can be seen from equation (T-7) that T = 1 if 


Kia + Kic = kin + Kia. 


Let us draw the indifference lines for the three possible types of 
learning problems using four configurations. 


T (T-7) 





1. Relational Discrimination. 
In figure 14, the indifference line for the relational habit is shown. 
Since ki, = ki, and k;. = kia anywhere along the line, T equals unity. 
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This case is analogous to the corresponding two-configuration case al- 
ready discussed. If the animal has been trained to make one response 
to configurations a and c and the other response to configurations b 
and d, transfer to new configurations will be relational in character. 


2. Total Brightness Discrimination. 

If one response is demanded to configurations a and c in figure 
15, and the opposite response to configurations b and d, the training 
conditions demand a discrimination of total brightness. Anywhere 


along the line in figure 15 

kia = kin and kie = Kia. 
Transfer to any configuration on this line will be equal, or, in other 
words, the discrimination will break down. 


_ 8. Absolute Discrimination, 


If one response is demanded to configurations a and c in figure 
16 and the other response is demanded to configurations b and d, an 
absolute discrimination habit is formed. Assuming that the conditions 
for learning the problem are satisfied, namely 


e+ Keoe 


ee ae oe 1 
Ka + ad - 


(See Equation IV-19) 
then kia + kie > kin + Kia for configurations near @ and ¢, and the 
a-response will be given. For configurations near b or d the reverse is 
true. Under these training conditions, transfer will be on an absolute 
basis. 

With these training conditions, there will be two indifference 
lines or lines‘ of equal transfer. These lines are labeled 1 and 2 in 


figure 16. 
Along line 1, kig = kin and Kic = Kia. 
Along line 2, ki, = kia and kin = Kie. 
In either case ki, + kic = ki, + kia and the value of T= 1. 


The discrimination will break down if the animal is presented with 
configurations lying on either of these two lines. 


SUMMARY AND DISCUSSION 


We have selected the problem of visual discrimination by the 
white rat to illustrate the development of a rational learning theory 
which is applicable to any situation in which the learning subject is 
required to respond in one way to one class of configurations and in 
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another way to another class of configurations. In our example, we 
have defined the response as being a jump to the right or a jump to 
the left, and the stimulus as consisting of the entire configuration 
presented on a given trial. 

In addition to these basic definitions, we have assumed that: 

(1) each rewarded response increases the tendency to repeat that 
response upon later presentations of the configuration to which it was 
made and also, but to a lesser extent, upon later presentations of other 
similar configurations; (2) each punished response decreases the 
tendency to repeat that response upon later presentations of the con- 
figuration to which it was made and also, but to a lesser extent, upon 
later presentations of other similar configurations; (3) the slope of 
the cumulative error curve is equal to the ratio of the strengths of the 
tendencies to respond incorrectly and correctly. 

On the basis of these fundamental definitions and assumptions 
(and with the help of certain simplifying conditions described in the 
paper), a rational equation of the learning curve has been developed. 
This equation was shown to be a hyperbola passing through the origin. 
With appropriate changes in the parameters it was shown that this 
equation describes the course of learning when four configurations are 
used instead of two and that it predicts the nature of the transfer to 
new configurations. 

A number of deductions may be made rigorously from the equa- 
tion. We have presented the following ones: 

(1) The difficulty of a problem, as measured by the maximum level 
of accuracy attainable, is inversely related to the distance separating 
the two configurations to be discriminated. Direct experimental work 
on learning is not available to verify this deduction. It is in agreement 
with a large body of psychophysical data. 

(2) The ultimate level of accuracy attainable in an absolute discrimi- 
nation will be lower than that attainable in a relative discrimination 
involving the same four configurations. This deduction is in agree- 
ment with experimental findings. 

(3) The ultimate level of accuracy of discrimination in a four-con- 
figuration problem, as measured by the slope of the asymptote, is the 
same for the extreme cases of pairs of configurations far enough apart 
to be essentially independent and for pairs of configurations close 
enough together to be practically identical. No experimental evidence 
on this point has been reported. 

(4) If the animal has been trained to go to the larger (or smaller) 
of two stimuli, it will respond on a relational basis in transfer tests. 
A large mass of experimental data reported by a number of workers 
is in agreement with this deduction. 
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(5) If the animal has been trained to go to one side when presented 
with two large stimuli, and to the other side when presented with two 
small stimuli, it will respond to test configurations on the basis of 
total size of stimuli. No experimental evidence on this point has been 
reported. 

(6) Under some conditions, consistency of transfer to any given 
configuration will increase with each increase in the amount of train- 
ing preceding the transfer test. Under other conditions, the consis- 
tency of transfer will reach a maximum with an optimal amount of 
training and will then decrease. The first part of this deduction is in 
agreement with experimental findings; there is no evidence on the 
second part of the deduction. 

(7) Consistency of transfer will decrease as the distance between 
the training configurations and the test configurations increases, This 


. deduction is in agreement with experimental findings. 


(8) If training has established an absolute discrimination, transfer 
to new configurations will be on an absolute basis. No experimental 
evidence on this point has been reported. 

The preceding deductions have been made with specific reference 
to size discrimination. As mentioned in the Introduction, similar de- 
ductions may be made for discrimination of other characteristics, such 
as hue, saturation, and brightness. 

The experimental testing of these deductions provides the sound- 
est method of testing the plausibility of the theory on which they were 
based. We have indicated that there is experimental evidence in quali- 
tative agreement with some of the deductions. Some of the deductions 
are completely untested. All of them are untested quantitatively. 
When the stimulus configurations are accurately scaled, the theory 
leads to a number of quantitative deductions regarding the ease or 
difficulty of different problems, and the certainty of transfer to given 
new configurations. If no qualitative disagreement between theoretical 
deduction and experimental fact is found, it will become desirable to 
make these quantitative predictions and test them experimentally. 

In developing the theory we have applied it to the problems of 
learning and transfer of visual discriminations by the white rat. The 
specific definitions and postulates used were made with this type of 
learning in mind. But the theory is more general than the illustration. 
By appropriate changes in the definitions and in some of the postu- 
lates, a similar development could be made for learning of any type of 
material to which the concept of scaling can be applied. 

An equation has been presented for the two-configuration case 
and extended to the four-configuration case. It may be further ex- 
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tended to cover situations in which more than four configurations are 
involved —-~ the n-configuration case. 

Furthermcre, by extension, the theory can be applied to more 
complex situaticns. Some experimenters have used a technique in 
which three or more stimuli were presented on each trial. To handle 
such situations we must assume that the response possibilities are no 
longer limited to a jump to the right and a jump to the left, but in- 
clude other possible types of response, such as a jump to the middle. 
Corresponding to this increase in the number of response possibilities 
there is a probable increase in the dimensionality of the space in which 
the stimulus configurations are plotted. If, for example, three stimuli 
are presented on each trial, the configurations could be plotted in terms 
of three coordinates. To extend the formulation to handle form dis- 
crimination may involve a similar increase in dimensionality. The 
distance between configurations consisting of different forms could be 
measured, and this problem handled in essentially the same way as the 
size or brightness discrimination problem. The labor involved in 
handling the n-dimensional case would be greatly magnified, but other- 
wise the 2-dimensional case and the n-dimensional case are analogous. 

Extension to the n-configurational case in n-dimensional space 
would allow treatment of some problems which are not ordinarily 
classed as discrimination learning. It may be pointed out as an ex- 
ample that concept formation or the learning of a generalization may 
be thought of as a discrimination problem involving a large number of 
configurations to which the same response is demanded. The learning 
by a child of the meaning of the concepts “bread,” “spinach,” and 
“orange” certainly demands that he be able to discriminate between 
the different stimulus configurations appropriate to these concepts and 
that he be able to give the same response to a number of similar con- 
figurations. In the examples chosen it may be predicted that the 
distances separating these configurations are fairly large. Learning, 
consequently, should be easier and incorrect transfer to new configura- 
tions less likely than where smaller distances are involved, as for 
example, in learning to discriminate between the situations in which 
conceptual responses of “duck,” “chicken,” and “goose” are appro- 
priate. 

Reports are occasionally made of positive transfer from one sense 
mode or one kind of stimulus to another sense mode or kind of 
stimulus. Thus, if an animal has been trained to go to the brighter of 
two lights, it may respond positively to the larger of two circles or 
the louder of two tones. Transfer of this type can be predicted 
theoretically in exactly the same manner as transfer under the more 
usual conditions. If the distance between a given brightness configur- 





al 


) are 


nore 
e in 
ndle 
2 no 
, in- 
dle. 
ities 
1ich 
nuli 
rms 
dis- 
The 
| be 
the 

in 
\er- 
US. 
ace 
rily 
ex- 
1ay 
of 
ing 
ind 
en 
ind 
on- 
the 
ng, 
ra- 
‘or 
ich 
r0- 


se 
of 
of 
or 


re 
r- 





HAROLD GULLIKSEN AND DAEL L. WOLFLE 247 


ation and a given tone configuration is scaled, the amount of transfer 
from one to the other can be predicted. Generally, two configurations 
involving only brightness would be closer to each other than would one 
configuration involving brightness and another involving loudness. 
Consequently the transfer to a configuration made up of a different 
class of stimuli would be expected to be fairly small, but it might occur. 

It is believed by some writers that a discrimination is learned very 
quickly when the subject once hits upon the hypothesis which the 
experimenter intended should be learned. This sudden learning is 
sometimes called insight. Insight, in terms of the present discussion, 
would consist of a sudden shift in the distance separating two con- 
figurations. An emphasis upon some aspects of the configurations 
might make two of them seem very close together; an emphasis upon 
some other aspect might make their apparent distance much greater. 


_A sudden increase in the perceptual distance between two configura- 


tions would result in a sudden decrease in difficulty of the problem. 
This change would produce the insight type of learning curve. 

The sudden appearance of transfer is sometimes referred to as 
an illustration of insight. The ape, for example, may for a time fail 
to respond to the limb of a tree as a stick for reaching, and then sud- 
denly use it for this purpose. In terms of the present theory this 
transfer would be described as an increase in the psychological simi- 
larity between the limb on the tree and the stick on the floor. 

Either of the two foregoing illustrations of “insight” may be re- 
garded as a shift in the “psychological distance” between the two 
stimuli involved. It is obvious that these statements do not constitute 
an explanation of insight but merely a suggestion for experimental 
work. For example, it may be pointed out that a change in the “dis- 
tance” between two stimulus configurations cannot take place without 
a simultaneous change in the distances between all similar configura- 
tions. Thus, after observing any evidence of insight one would expect 
shifts in the ease of learning and transfer in numerous related prob- 
lems. On this point no experimental work has been reported and the 
possibility seems not to have been noted. 

Possible Changes in the Theoretical Formulation 

We have chosen certain definitions of the two basic concepts of 
stimulus-configuration and of response, and have made three assump- 
tions regarding the course of learning. These definitions and postu- 
lates were chosen because they seemed to predict more experimental 
facts and relationships upon a more parsimonious postulational basis 
than did any other set of definitions and postulates. Others could have 
been used, and it is possible that some of these might have led to a 
more general theory, to one with simpler equations, or to one which 
predicted more experimentally verifiable facts. 
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Consider, for example, the definitions of stimulus and response 
used by Spence (25, 26). He has assumed that at the beginning of 
training the rat has tendencies to respond directionally (to the right 
or left) and relatively (to the larger or smaller). The relational ten- 
dencies appear quite reasonable upon the basis of experimental obser- 
vation of the behavior of rats. Conversely, it seems unreasonable from 
an anthropomorphic point of view to deny that the rat discriminates 
the relative size or brightness of stimulus objects. Yet we have made 
that denial and defined the rat’s response directionally. We rejected 
Spence’s definition and adopted a directional one when we found it 
possible to predict relational transfer on the basis of directional habits. 
This prediction seemed scientifically more interesting than to assume 
a relative tendency at the beginning and then deduce relative transfer 
later. 

The fact that relative transfer can be predicted without assuming 
that the animal can discriminate relative differences initially shows 
clearly that the fact of relational transfer can no longer be used as 
evidence to support the assumption of relational learning. Such pre- 
diction suggests a method of ascertaining just what type of evidence 
is necessary to determine the basis of the rat’s discriminatory ability. 
If human beings can react directly to relative differences in a sense in 
which rats can not, attempts to verify our deductions regarding trans- 
fer should show different results for rats and human beings. These 
differences might indicate the type of evidence necessary to demon- 
strate an ability to react directly to relations. 

In plotting stimulus configurations we have represented each one 
by a single point, the two coordinates of which represent the same 
characteristic (such as size) of the two different stimuli which make 
up the configuration. This technique is more or less artificial in that 
it treats a characteristic of each of two stimuli which happen to be 
presented simultaneously as two dimensions. It may be that the theory 
and the associated scaling technique will apply better to a situation in 
which a single stimulus object may be described in terms of two char- 
acteristics. The discrimination of colors which vary simultaneously in 
brightness and saturation is one such situation. 

Our first two postulates assumed that the learning parameters, ¢ 
and k, are constant throughout the course of learning; that is, that 
each trial produces the same learning effect. Krechevsky (16) named 
this assumption “the continuity theory,” and devised an ingenious ex- 
periment to show that it did not apply during the initial trials. He 
suggested that in the first trials the rat is not learning the problem 
the experimenter had in mind and that consequently rewarding or 
punishing the rat for correct or incorrect choices has no effect on the 
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learning of that particular problem. He named this suggestion “the 
discontinuity theory.” The suggestion that the first trials in the dis- 
crimination experiment are peculiar and not a part of the same con- 
tinuum as the remaining ones has previously been made by Lashley 
(17) and also by Thurstone (29) and by Wiley and Wiley (32) with 
special reference to the treatment of these errors in rational learning 
equations. 

It is possible to interpret the early trials in a discrimination prob- 
lem on any one of several bases. The discontinuity theory of Krech- 
evsky is one possibility. Two other assumptions harmonize well with 
Krechevsky’s data. It may be assumed that the early trials constitute 
a period of adaptation to the experimental situation and that learning 
does not begin until some time after training has started. Or it may~ 
be assumed following Spence (25, 26), that the values of c and k vary 
with practice. 

Finally, it may be assumed, as we have done, that the values of 
cand k are constant throughout the course of training. The learning 
curve based on this assumption fits learning data well. The theory 
based upon it predicts a number of known experimental findings. 
Without it, the equations are quite complicated. Our assumption is 
undoubtedly an oversimplification of the real situation. But in the 
present state of knowledge of learning, the fact that a theory does 
predict a large number of items concerning learning outweighs in im- 
portance the preliminary oversimplification. 

Our third postulate was that the slope of the cumulative error 
curve equaled the ratio of the strengths of the tendencies to respond 
incorrectly and correctly. While using it gave a number of interesting 
developments, the equations turned out to be mathematically unwieldy 
in many places. This assumption was made following Thorndike’s 
suggestion (27) that such a relationship might exist. It has been used 
because no ready substitute has been found. Possibly the assumption 
that strength is a negatively accelerated function of the number of 
rewards and punishments could be combined with a simpler equation 
relating the slope of the cumulative error curve to the strengths of the 
responses involved. This combination might lead to a simpler mathe- 
matical development, but we have not as yet followed up the possi- 
bility. 

We realize that the definitions and postulates upon which the 
theory has been developed will be greeted with skepticism by some 
psychologists. We are aware that the theory will be criticized because 
a particular definition or postulate does not agree with the critic’s 
notion regarding the behavior of a learning animal. Criticism on such 
grounds is not crucial. When an experimental report is published, 
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other investigators who question the results are at liberty to point out 
experimental flaws in the method or to repeat the experiment. If the 
repetition fails to lead to verification of the original results, then the 
question of which experiment is “right” arises. Either, both, or neith- 
er may be; the test is usually not a crucial one. When a rational theory 
is published, other investigators who question the theory have before 
them an opportunity for a much more crucial test of its adequacy. The 
first method is to test the precise deductions drawn from the theory. 
If these deductions can be shown to be in conflict with experimental 
data, the theory is shown to be inadequate. The second method con- 
sists of developing another theory upon the basis of definitions and 
postulates which seem more logical to the critic. If the second theory 
can be shown to be more fruitful — to lead to the prediction of more 
experimentally known or discoverable relationships, then it has served 
to render the first theory inadequate. Both methods of attack are open 
to critics of this theory. 

Most of the work reported in the psychological literature con- 
sists of single experiments or groups of experiments centered about 
more or less isolated problems. A given problem is studied; certain 
results are obtained; and an interpretation is placed upon these re- 
sults. Someone else repeats the experiment under altered conditions; 
different results are obtained; and the interpretation is different. 
Eventually there comes a time when one has a number of groups of 
experimental results each with a more or less distinct interpretation. 
This situation is scientifically unsatisfying and so attempts are made 
to unify the facts with some basic theory. The ‘“‘identical-elements” 
theory of transfer of training or Krechevsky’s “hypothesis” (15) 
theory of discrimination learning illustrate the type of intuitive theor- 
ies which are usually developed when this stage is reached. 

The difficulty with intuitive theories lies in the difficulty of devis- 
ing crucial checks of their adequacy and the limited possibilities of 
making experimentally verifiable deductions from the theories. Un- 
satisfied with theories the acceptance of which is so largely a matter 
of belief and interpretation, some workers have attempted to unify the 
facts of a given field of experimental knowledge into a rigorous ration- 
al theory. Rational theories have a distinct advantage over both iso- 
lated experimentation and intuitional theories in that they lead to the 
deduction of very specific experiments which provide more crucial 
tests of the adequacy of the theory. The predictive power of such ra- 
tional theories is illustrated by the developments of Thurstone (28, 
29), Hull (11, 12), and of Rashevsky (23). Each of these develop- 
ments has opened up a number of new experimental problems; each 
has suggested experimental methods of testing the theory itself. 
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This theory has presented a number of precise deductions regard- 
ing learning and transfer which are capable of quantitative experi- 
mental proof or disproof. These experiments will lead to a more com- 
prehensive theory which will render the present one obsolete. In the 
meantime the present formulation has indicated an order and unity 
underlying the processes of both learning and transfer. 
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On the basis of previous mathematico-biophysical studies, a the- 
ory of visual perception is outlined. The theory is applied to the 
aesthetic values of polygonal patterns and found in fair agreement 
with experimental results. 


In a number of previous publications, as well as in our book 
“Mathematical Biophysics” (1), we have laid down the foundations 
for a physico-mathematical theory of neuro-physiological and psycho- 
logical phenomena. This paper is a continuation of the previous stud- 
ies. A full understanding of the following presupposes a familiarity 
with the third part of the book, which is referred to as MB with a 
roman figure denoting a chapter. 

While the present paper deals in a sense with a theory of the 
“Gestalt’’, different from the one discussed in MB, XXVII, it does not 
reject or contradict the older results. The mechanism of “Gestalt-in- 
variance” discussed in MB and studied further in a paper by A. 
Householder (2) may account for certain properties, such as percep- 
tion of wholeness, etc., while other properties may be due to a mech- 
anism discussed here. The two mechanisms, as will be readily seen, 
may coexist in the central nervous system. 


I 


In preceding publications (3, 4, 5) we have suggested, how 
through the mechanism of internal inhibition, a discrimination of ab- 
solute intensities of a stimulus may be obtained. We shall now de- 
velop the above suggestion somewhat more in detail, indicating a way 
of putting it on a quantitative mathematical basis. 

Let a chain of afferent nerve-fibers carry a peripheral excitation 
to a center C, in which the end-fiber of the chain gives off a large 
number of branches, b, all. of them forming synapses with neurones 
(ne) of higher order (Fig. 1). Let these neurones possess different 
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FIGURE 1 


thresholds h, distributed according to some distribution-function 
N(h), so that 


[Naan nit (1) 


where N is the total number of neurones (ne). 

For a given intensity S of the peripheral stimulus there corre- 
sponds a definite value E, of intensity of excitation in each branch b 
(MB, XXII). If the chain consists of only one fiber, then EF, is a lin- 
ear function of S. In general however E, is a more complicated func- 
tion of S, depending on the constants of the individual members of 
the chain 


E,=f(S). (2) 


The intensity of excitation EZ, of each neurone (ne) is given by 
(MB, XXII): 
Ene = B(PE,—h). (3) 


The total intensity of excitation of all neurones (ne) which have a 
threshold h < PE, is then given by 


E(h) = B(PE,—h)N(h). (4) 


If N(O) = O as should be expected from general considerations, then 
E(h) is zero for h = O and for h = PE, and is positive within this 
interval. Hence in that interval E(h) has at least one maximum. Let 
us first consider the case that it has only one maximum, h = hy, hn 
being a function of FE, . 

Let all neurones (ne) send fibers to a center C,, in such a way 
that all neurones (ne) with the same threshold h form a synapse with 
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the same neurone of third order (ne;) and let moreover each neurone 
(ne) send off a collateral, which excites inhibitory fibers (nz), lead- 
ing to all the synapses s; (MB, XXII). Consider for simplicity that 
all thresholds h; of ni are the same. In Fig. 1 only one fiber of each 
group of the same threshold is shown, for simplicity. The situation 
is formally the same as represented by equ. (74) of Chapt. XXII of 
MB., in which f,; = Const. Replacing the sum by an integral, we 
have for the value of (« — 7) at each synapse s; (h) 


(e—j) (h) = PE(h) —Q f ““TE(h) —hi]dh. (5) 


The integral at the right-hand side of (5) is a positive constant, en- 
tirely determined by P, E, h; and E(h). Denoting this constant by 
K , we have 

(e— 7) (h) = PE(h) —QK. (6) 


From what we have said about E(h) and from (6) it follows that 
(e — 7) (h) will be positive only for values of h , which lie in the in- 
terval (h,, hz) where h, and h, are the smaller and larger roots of 
the equation 


En) —2S = 0. (7) 
h, and h, > h, are both positive and lie in the interval (O, PE,) ; 
h, — h, decreases with increasing K. We also have 


hy < hn < he. (8) 


If K is large enough, then (e — j) at s; will be positive only for such 
values of h which lie in the neighborhood of h.,,, which correspond 
to the maximum of E(h). Therefore, although for a given E, all neu- 
rones (ne) are excited, only a small group of synapses 8; will be ex- 
cited. If by varying the intensity S of the peripheral stimulus, we 
vary E, , this will result in a variation of h,, and if for a new value 
E’, of E, , the corresponding h’,, will be sufficiently different from the 
h»,, then entirely different groups of synapses s; will be excited by 
the stimulus S’ than by S. Thus, to any intensity S of the peripheral 
stimulus corresponds the excitation of a definite group of synapses 
8;. If each of these synapses is connected to a conditioning circuit, 
described in a previous paper (5), then these synapses may be condi- 
tioned to a given reaction. Thus a stimulus of intensity S will pro- 
duce a reaction R through a group s; of synapses. A stimulus of a 
different intensity S’ will however not produce R , because it involves 
a totally different group s’; of synapses. Each intensity S of the same 
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stimulus has thus a representative individual group of synapses in 
the nerve-centers and therefore each intensity may be in a way con- 
sidered as a different stimulus-pattern. 

However things will happen in this fashion only if the difference 
between the intensities S and S’ is sufficiently large. If this is not the 
case, then h,, and h’,, differ very little, and the corresponding intervals 
(h,, he) and (h',, h':) will partially overlap (Fig 2). If now S is 





FIGURE 2 


conditioned to a reaction R, then S’ produces also R without being 
previously conditioned to it, through all the synapses, which corre- 
spond to the interval (h’, , h.) (Fig. 2). Since, because of equa. 
(4) and (7) h, and h., as well as h,» are functions of E, or, what 
amounts to the same, functions of S, therefore the minimum differ- 
ence AS = S — S’ for which the intervals (h, , he) and (h',, h'.) do 
not overlap at all, is itself a function of S. The determination of this 
function AS = U(S) is in principle a simple problem, which may 
however involve some complicated algebra. The function U(S) is de- 
termined by f(S) in equ. (2) and by N(h) in equ. (1). Several prob- 
lems suggest themselves at this stage. For instance we may ask the 
following question: How should f(S) and N(h) be chosen in order 
that AS = U(S) would have a prescribed form, say that of Fechner’s 
law? 

When (h,, he) and (h',, h’.) partially overlap, we may investi- 
gate what will be the relative intensity of RF , as produced by S’ com- 
pared to that produced by S. In the simplest case this relative inten- 
sity will be given by the ratio 7» of all neurones lying in the interval 
(',, he) (Fig. 2) to those lying in the interval (h,, h.). That is 


» N(h)dh 
R = n Sao aoe _s (9) 
f N(h)dh 
hy 





If we make more complicated assumptions about the possible inter- 
action of the neurones which lie on the efferent side of the synapses 
8; we shall obtain expressions different from (9). Here is another 
interesting group of problems. 

The case when E(h) has several maxima in the interval 
(O, PE,) is more complicated but is treated in a similar way. 
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II 


We have seen elsewhere (MB, XXVI), that when a fiber, charac- 
terized by 


MMos Dae (10) 
a; ~ b; 

forms a synapse with another fiber, then the latter becomes excited 
for a short time only when the intensity of excitation of the first 
fiber suddenly increases. If the first fiber is excited continuously with 
a constant intensity E', the second fiber is not excited. As we have 
seen in MB the intensity of excitation of the second fiber increases 
with increasing AE where AE is the amount of sudden increment of 
E. As can be readily seen from the discussion in MB, XXVI, the 
liminal AE’ which still produces an excitation of the second fiber is 
a linearly decreasing function of E. Since EZ is in general some func- 
tion E = f(S) of the intensity of the peripheral stimulus, which in- 
volves the first fiber, therefore this liminal AE will be in general more 
or less complicated function of S , depending on the shape of f(S). 

If the second fiber leads to a center C,, discussed in section I, 
then to each intensity of excitation of the second fiber there will corre- 
spond a definite group of excited neurones (ne,). And since a defi- 
nite intensity E* of excitation of the second fiber corresponds to a 
definite relative sudden increment AE /E of the intensity of excitation 
of the first fiber, while this relative increment corresponds itself to a 
definite relative increment AS/S of the peripheral stimulus, there- 
fore to every value of AS/S there will correspond a definite individual 
group of neurones (ne;,). As has been shown in MB, XXVI, a simi- 
lar mechanism is obtained for a sudden decrease of E', respectively 
S, that is for a negative AS/S. 

We thus have a situation, in which a sudden change of intensity 
of a constant external stimulus results in a short excitation of a 
definite group of central neurones, different groups corresponding 
to different amounts of change. 

Combining this result with that of section I, we may say, that 
to each absolute value of intensity of a peripheral stimulus corre- 
sponds the excitation of a definite center consisting of a definite group 
of individual neurones ne, ; and to each value of relative change of 
the intensity of peripheral excitation there also corresponds the exci- 
tation of a similar definite center. The finite though very large num- 
ber of neurones in the brain causes an overlapping of some of these 
centers and thus causes the perception of continuity of the possible 
values of intensity of peripheral excitation. 
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III 


We shall now introduce some neuro-physiological assumptions, 
which may appear to be rather questionable. We are introducing them 
here not because we consider them to be any more likely than other 
possible assumptions, but rather in a tentative way, as a sort of work- 
ing hypothesis, in order to show how by means of such or similar 
assumptions we may develop a physico-mathematical theory of some 
such phenomena of visual perception, the discussion of which hither- 
to frequently remained on a purely qualitative level. In the last sec- 
tion of this paper we shall discuss some possible modifications of these 
hypotheses, modifications which, however, may leave the fundamental 
formal results unaltered. 

When we look at a segment of a straight line, we fix alternately 
our attention on different points of this line. This results in move- 
ment of our eyes along that segment back and forth. Experiments on 
eye movements (6) show that there is no simple relation between 
the shape of a figure contemplated and the path of the eye movement. 
Actual following of a rectangle gives a rather irregular path for 
the eye movement (7). Inasmuch however as the eye muscles are 


FIGURE 3 


innervated by several centers in a rather complicated fashion, the fol- 
lowing hypothesis will not necessarily be at variance with observa- 
tions. Suppose that there is a group of brain centers, which innervate 
the eye-muscles in such a way as to make the eye follow exactly a 
contour which is projected on the retina. Each eye muscle is however 
innervated by a number of other centers, which produce different 
movements, superimposed on the above. Thus the actual movement 
of the eye may have no apparent relation to the shape of the contour 
contemplated, yet it contains as its part such movements, the path of 
which is an exact replica of the geometrical pattern looked at. 
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On Fig. (3) C, represents schematically the center, which would 
produce such exact movements of the eye, through the muscle M, 
while Cy stands for all other centers which may impart to M move- 
ments that have nothing to do with the shape of the contour. The 
character of the movement which would be produced by M under the 
influence of C, alone, depends on the character of excitation of C, (in- 
tensity, duration). If through a synapse s a fiber a is excited which 
leads to a sensory center Cp (Fig. 3), then to each definite type of 
excitation of C, , or in other words, to each definite type of movement 
produced by M under the influence of C, alone, corresponds a definite 
intensity and duration of excitation of Cp. If M would be innervated 
by C, only, then the same type of excitation in Cp could be obtained 
through proprioceptive fibers, which would lead from M to C,, and 
which would. be excited by the contraction of M. In such a case the 
proprioceptive impulses would have a definite relation to the shape 
of the contemplated pattern. Actually, due to the other centers Cy, 
the proprioceptive impulses, coming from M have no relation to the 
shape of the pattern. But the impulses coming through s over a (Fig. 
3) do have such a relation. In other words according to the scheme 
represented on Fig. (3) things will happen at Cp in such a way, as if 
the movements produced by M would exactly correspond to the shape 
of the contour contemplated, and as if these movements would send 
off proprioceptive impulses directly to Cp. 

The above considerations therefore enable us to speak, for sim- 
plicity, of the eye actually following the contemplated contour, and of 
proprioceptive impulses corresponding to such movements being sent 
to some sensory centers. Though actually things are much more com- 
plex, by means of the above scheme we may interpret our schema- 
tized statements in terms of a more exact neuro-physiological pic- 
ture. Thus when in the following we say: “The contraction of the 
eye muscle sets up proprioceptive impulses, which behave so and so”, 
we do not mean it literally but actually imagine a scheme like the one 
represented on Fig. 3. 

Neurological mechanisms that would produce such a movement 
can be readily suggested and shall be discussed in a separate paper. 
For the time being let us make for simplicity and as a special 
purely theoretical case, the assumption that this movement of the 
eyes goes on with a constant velocity v. Let a line segment AB be 
placed in a vertical plane, and let its angle with the vertical be @. 
(Fig. 4). Neglecting the possible role of the oblique eye-muscles, the 
movement of the eyes in the direction AB involves a contraction of 
the Rectus superior of both eyes, the right Rectus lateralis externus 
and the left Rectus lateralis internus and a relaxation of both Recti 
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' FIGurE 4 

inferiors, the right lateralis internus and left lateralis externus (6). 

Assuming again for simplicity that the coordinated movements 
of the left lat. int. and the right lat. ext. are controlled from one com- 
mon center, and that the same holds for the right lat. int. and left 
lat. ext., we may greatly simplify our considerations by consider- 
ing only the movements of one eye, for instance of the right one. 
Then the movement in the direction AB (Fig. 4) involves a contrac- 
tion of Rectus sup. and of lat. ext. and a relaxation of Rect. inf. and 
lat. int. which may be written in an abbreviated form, thus: 


AB: rs+ ;le+ ;r— ;li—; (11) 
The movement in the direction BA involves: 
BA: rs— ;le— ;rit ;li+; (12) 


The changes in muscular contraction of the four muscles produce pro- 
prioceptive impulses of different intensities and if the proprioceptive 
fibers lead to centers C, , discussed in sect. I, then any given intensity 
of contraction will produce excitation in a definite individual group 
of neurones (ve.,). Again for simplicity we shall consider that only 
positive contractions produce proprioceptive impulses, a relaxation 
being characterized by a lack of such impulses. Let the intensity of 
these impulses be proportional to the velocity of the contraction of 
the muscles. Then, if v is the velocity of the eye movement, the ve- 
locities of contraction of rs and le in (11) are respectively: 


vcos@ and vsing 
and the corresponding intensities of the proprioceptive impulses are: 


E,,=avcos@; E,.=avsing (13) 


a being a coefficient of proportionality. 

The excitation E,, excites a definite individual group of neurones 
(ne,) in a center, which we shall call V (for vertical) ; while FE). ex- 
cites a group of neurones in a center E (horizontal). 








5). 
nts 
m- 
aft 
r- 
1e, 
UC 
nd 





N. RASHEVSKY 261 


The movement of the eye in the direction BA produces the fol- 
lowing excitation: 


E,,;=avcos@; E); =av sine (14) 


and causes the excitation of two other groups of neurones (7e;), one 
in V, one in E.. If the eye moves up and down indefinitely altogether 
4 groups of neurones are excited. If we consider now another seg- 
ment of a straight line, which is characterized by a different 6 = 6’, 
again 4 groups of neurones (neé;) will be excited, but these 4 groups 
will be different from the former 4, because E,, , Ei. , Ey; , Ei; are dif- 
ferent. Some of the new groups may partly overlap with the old ones 
only when 6 — 6 = A@é is very small. Thus to every position of a 
straight line segment in a vertical plane corresponds the excitation 
of four distinct individual centers, 2 in H and 2 in V. 

Considering now the segment A’B’ (Fig. 4), symmetrical to AB 
with respect to the vertical OO’ , we shall find by similar considera- 
tions, that its scanning with the eye in the direction A’B’ gives 


E',, =avcosd=E,,; E'}; =avsind=Ej; (15) 
and in the direction B’A’ 
E’.,=avcosd—E,;; E'1.=went=E,,. (16) 


That is AB and A’B’ will excite identical groups of neurones (ne;) in 
E and V. This difficulty can be avoided in several ways, of which we 
shall consider here only one. 

Consider a center C, of permanent excitation, such as discussed 
in MB, XXIV, which excites inhibiting fibers, that lead to synapses s , 
which connect the afferent fibers from li, le and vi to neurones of 
higher order (Fig. 5). Let the contraction of rs excite the fiber (1), 
which leads to the centers V. Let (1) send off collaterals (2), which 


rs & le ré 


( 





FIGURE 5 
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excite inhibitory fibers (3), which in their turn inhibit the synapses s' 
and thus disinhibit s . In that case a contraction of rs , that is a move- 
ment of the eye upwards, if followed by a lateral movement produces 
an excitation both in V and EF. But a downward movement of the 
eye, produced by a contraction of ri and a relaxation of 7s , does not 
result in any excitation in either V or E'. Of course other brain cen- 
ters may be excited by such movement, by means of other fiber con- 
nections. But at present we are interested only in V and E.. We now 
have for the movement along AB (Fig. 4). 


AB : rs + ;le+ ;ri— ;li— (17) 
and for BA: 
BA: rs— ;le— ;n—: h— (18) 
For A’B’ we have 
A'B': rs+ ; 4+ ;ri— 3 le— (19) 
and 
B'A' : rs— ;4— 3 ri— sle— (20) 


From (17) and (18) it follows that scanning of AB in both directions 
results in: 
E,,=avcos6@; E,;.= av sine (21) 


while scanning A’B’ in both directions results in: 
E',,=avcosé; E',;.=av sine (22) 


Both AB and A’B’ involve rs in the same amount and therefore excite 
the same group of neurones in V. But, while AB involves le , A'B’ 
involves li. Hence the groups in E, due to following of the two lines 
in both directions, will be different. Every segment of the straight 
line excites one group of neurones in FE and one in V , the groups dif- 
fering for different 6’s. Lines symmetrical with respect to either a 
vertical or horizontal line have a common group in V. Parallel seg- 
ments produce identical excitation. 

The proprioceptive excitations, due to the contemplation of any 
segment of a straight line may thus, through V and E, be conditioned 
to a response R. A different segment will in general not produce F, 
because it involves different groups of neurones in V and E. A seg- 
ment symmetrical to the original one with respect to a vertical line 
may however produce a weaker FR through the group of neurones in 
V , which it has in common with the orizinal segment (MB, XXV). 








es 3 
OVe- 
Ces 


not 
cen- 
0n- 
now 


18) 


19) 


20) 


ons 


12) 





N. RASHEVSKY 263 


(e- .cl 
FIGURE 6 
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Now consider two segments AB and BC (Fig. 6), forming an 
angle at B. Following this figure with the eye results in the excita- 
tion of two groups of neurones in V and of two groups in E. If how- 
ever the proprioceptive fibers, leading from rs, li, and le to V andE , 
send off collaterals, which are characterized by (10) and which each 
leads to a center A*, A* will become excited every time the eye passes 
the angle B , because of a sudden change in the intensity of excitation 
E,, and E,. (or of E,, and E,; for a differently oriented angle). 

To each value of the angle B corresponds a different intensity of 
excitation in A* and if A* is connected to a center A , of the kind dis- 
cussed in section I, then to each value of the angle B there will corre- 
spond a, definite group of neurones (”e;,) in A. We thus have a cen- 
ter for the perception of angles. 

Making different hypotheses concerning the physical character- 
istics and constants of all centers involved, we may derive various 
quantitative relations, which may suggest experimental studies. As 
an illustration we shall discuss here only one example. Let A*, be 
the center, which is excited by a sudden change in E,, = av cos 6 and 
let the properties of this center be such that the liminal change AE, , 
which produces the excitation of A*,, is a constant independent on 
E,,. Since E,, decreases with increasing 6, that constant must be 
negative. Hence: 

















AE,, = —K. (23) 
Then the liminal change Aé, which is still perceivable, is given by 
_ @e e dé 
A6 coat dE +. AE ‘+s 7? —K dE,, . (24) 
Since 
6 = arc cos Ev, , (25) 
av 
we find: 
sii a Ex. K _ K Ev, ; 
av Evs V ay? = E 28 
J wnt azv? 


or expressing E,, again through 6 : 
Aé = K cot 0. (26) 


' dé 
This equation obviously does not hold for 6 = O. since the qe =’ 
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and (24) cannot be used. Expanding cos 6 for very small 6’s, we 


have 
cosd=1— =. (27) 


Hence 


E,, = av (1 —=) (28) 


and since for 6= 0, E,, = av , Aé corresponding to AE,, = K is given 
by: ‘ 


av ed =av—K; 


Ao = V2K. (29) 


In the neighborhood of 6 = 0, Aé is given by (29) and is constant. 
For larger @’s it is given by (26). This simple theory requires that 


we should easier perceive a small kink in a horizontal line (@ = 5) 


than in a vertical. Modifications of the assumptions about AE,, as 
well as of other details of the picture, lead to different formulae for 
Aé@. Consideration of the variation of both E,, and E;; (or E,,) leads 
to an expression for Aé, which has a minimum for 6 = 45°. A num- 
ber of mathematical problems are thus suggested, which, when worked 
out and compared with available experimental data, may throw val- 
uable light on our interpretation of the structure of the nerve centers. 

Finally we may consider that not only the speeds of the contrac- 
tion of rs, li and le control the excitation of some centers, but that 
speeial fibers are provided, in which the intensity of excitation is pro- 
portional to the amplitude of the muscular movements. In this fash- 
ion, by similar considerations as above, we shall arrive at a picture, 
in which a particular group of nerones (ne;) is excited in a center L, 
the group being characteristic for the length of the segment of 
straight line considered. 


IV 

If a polygonal contour, consisting of n sides and n angles, is pre- 
sented to the subject then it follows from the foregoing, that as this 
contour is followed by the eye, in general n groups of neurones will 
be excited in V, » groups in H, n groups in A and n groups in L. 
The intensities of excitation of the different groups, even belonging 
to the same center, such as H , or V , will be in general different. Two 
groups of neurones (ve,) corresponding to two different distinct in- 
tensities E’ , will not only be distinct, but will in general contain dif- 
ferent numbers of neurones. If the polygon possesses some symmetry 
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properties, which result for instance in some of the angles being 
equal, this reduces the number of distinct groups in A. If there are 
m equal sides, then they all excite the same group of neurones (7e;) 
in L, etc. As the contour is followed by the eye, the excitation of each 
group comes and goes, discontinuously, occurring only when the at- 
tention is concentrated on the particular element (segment), angle, 
length. If out of m elements, m are identical, the group (”é;) corre- 
sponding to those m elements, k, will be excited on the average per 
unit time m times more often, than those groups (”é€;), which corre- 
spond to elements that have no other identical ones. Hence the aver- 
age intensity of excitation of such a common group, corresponding to 
m identical elements k , will be m times the excitation due to a single 
element /. 

We may now consider a scheme, discussed in MB, XXII, in which 
an excitatory fiber leads from neurones of each center H, V, A and 
L to neurones of higher corresponding centers H’, V’, A’ and L’. This 
excitatory fiber branches off inhibitory fibers, leading to the neurones 
of all other centers. We may ask for the total excitation corresponding 
to a given polygonal contour. If this total excitation is transmitted to 
the pleasure-center (MB, XXX) then its intensity may be considered 
as a measure of the pleasantness or of the aesthetic value of a given 
contour. The problem is perfectly definite and can be treated as in MB, 
provided the intensities of excitation of all the groups (ve;) in H, 
V,A and L are given, in other words provided the distribution fune- 
tion N(h) [equa. (1)] for different centers, as well as the functions 
given by equa. (2), are specified. For different choices of those func- 
tions the problem will be of different degree of mathematical difficulty. 
A particularly simple case is obtained, when we consider a very spe- 
cial case, that the intervals (h,, h2) contain always the same number 
of neurones, and that approximately the intensity of excitation of 
any such group, falling in any interval (h, , h.) is constant. This case 
is rather unlikely to occur exactly, but we shall consider it here as a 
possible approximation. In that case two different elements, for in- 
stance angles, are characterized by two distinct groups (ne;), having 
however the same total intensity of excitation FE, , when the element 
is perceived. To m identical elements corresponds one group with an 
average intensity mE,, since the group is stimulated m times more 
frequently. 

We shall apply the above considerations to a group of polygons, 
for which G. Birkhoff (8) has calculated quantitative aesthetic values 
on the basis of some general considerations of complexity and order 
and for which those values have been measured by R. C. Davis by the 
rank order method (9) (Fig. 7). Let the intensity EF, of excitation 
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of a group (ne;) be taken as (0.1). The intensity of inhibition, which 
is approximately proportional to E, (MB, XXII) is taken to be 0.001 
per unit of E,. Then for No. 50 (Fig. 8) we have the following: 





FIGURE 8 


There are altogether 8 segments of horizontal lines, which do not 
give any excitation at V: 3 —4, 5—6,8—9,10—11,15— 16, 
17 — 18, 20 — 21 and 22 — 23 ; they all cause the excitation of 
one group (ne;) in H, with an intensity of 0.8. The sides 1 — 2, 
6 — 7, 138 — 14 and 18 — 19 are all parallel and therefore identical 
in their H and V groups. Hence in the H center they give one group 
with E = 0.4. Similarly 1 — 24, 19 — 20, 12 — 13 and 7 — 8 give in 
the H center one group with E, = 0.4. In the V center the sides 2 — 3, 
4—5,9—10,11— 12, 14— 15, 16 — 17, 21 — 22 and 23 — 24 
give one group with E, = 0.8 ; while 1 —2,6—7,7—8,12— 18, 
13 — 14,18 — 19, 19 — 20 and 24 — 1 give another group in V with 
E = 0.8. In the A center we have one group with E = 1.2, correspond- 
ing to the right angles 3, 4, 5, 9, 10,11, 15, 16, 17, 21, 22 and 23, a group 
with EF = 0.2 for 1 and 13, another group with E = 0.2 for 7 and 19, 
one group with EF = 0.4 for 2, 12, 14 and 24, and one group with 
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E = 0.4 for 6, 8, 18 and 20. In the L center we have one group with 
E = 0.8 for the 8 equal sides 2 — 3, 4 — 5, 9 — 10, 11 — 12, 14 — 15, 
16 — 17, 21 — 22 and 23 — 24, another group with FE = 0.8 for the 
8 equal horizontal sides and a third group with E = 0.8 for the 8 equal 
inclined sides. Altogether we have the following scheme: 


H 
1 group of E = 0.8 
$ “* “ F=04 


V 
2 “e 6c E = 0.8 

ich A 
01 7 * * Pw 
DCs NSO 
2 * * §F=064 

L 
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The first group of H receives inhibitory fibers from all other groups, 
that is from altogether 12 groups. The corresponding intensities of 
inhibition are 0.04, 0.04, 0.008, 0.008, 0.012, 0.002, 0.002, 0.904, 0.004, 
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0.008, 0.008 and 0.008, altogether 0.072. Hence the net excitation of 
the first group in V is 0.8 — 0.072 = 0.728. Each of the two other 
groups in H receives again inhibitory fibers from 12 other groups, 
and the net excitation of the group is thus found in a similar way to 
be 0.324. Calculating thus the net excitations for each group and 
adding all of them up we obtain for the total value of excitation 7.04. 
In a similar way the total intensity of excitations of the groups were 
computed for the other 10 polygons of Fig. 7 and the results are 
plotted on Fig. 9 (Full light line). The broken and alternate lines of 
Fig. 9 represent the experimental rank order values of R. C. Davis 
for two different groups of students. The dotted line represents values 
calculated by Birkhoff. Inasmuch as our values give the actual inten- 
sities of central excitation, measured in arbitrary units, while Davis’ 
values give only the rank order, in which a certain arbitrariness en- 
ters in the assignments of weights, we cannot expect a close numer- 
ical agreement between our values and those of Davis. The general 
trend however agrees, with the exception of No. 20. 

We notice that our theory gives too low values for No. 20 and 
No. 60. The fact that No. 60 has actually a much higher value than 
No. 10, although both are septagons with approximately similar sym- 
metry properties, suggests that the V-groups, which correspond to 
smaller @ , are more strongly excited than those corresponding to larg- 
er 6’s. The assumption of unequal excitation of the different groups 
would make our calculations rather cumbersome. Leaving such a more 
rigorous study for another paper, we shall approximately take into ac- 
count a possible preference for vertical directions by multiplying the 
values of total intensity of excitation as calculated, by the ratio of the 
heights to widths of the corresponding polygons.: This is obviously a 
crude method, holding only for not too large ratios. The results are 
shown on Fig. 9, heavy line. The general agreement is somewhat im- 
proved, at the expense of No. 80, which is too much lowered. In view 
of the very crude approximations made, these results should com- 
mand our attention. Further refinement and generalization of the 
theory, along lines indicated above, is likely to bring further im- 
provements. 

If we do not make the rather crude assumption, that all the cen- 
ters are excited with equal intensity, then obviously the final results 
will depend on the assumption about the intensity of excitations of 
the different centers. These intensities of excitation may vary from 
individual to individual, thus accounting for the differences in tastes 
(MB, XXIX). On the average these intensities may be approximately 
equal, which would account for the fair agreement of the present 
theory with observations made on larger groups of people. 
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V 

The influence of various symmetries, that is the appearance of 
several identical elements, may be easily analysed by considerations 
similar to those given in Chap. XXII of MB. If we have for instance 
n equally externally excited but different elements, then each of the 
corresponding groups (ne;,) receives an inhibition from (n — 1) other 
groups. If m of these elements are identical, then we have only 
n—m-t+ 1 groups. The total applied excitation remains the same; in 
the first case it was nE , now it is (n — m)E + mE. But the group 
corresponding to the m identical elements receives inhibitory fibers 
from n — m other groups. Thus the total inhibition is decreased, but 
the net excitation increased. If we take as a measure of complexity 
the total number of elements in the pattern, then various types of 
symmetry have the effect of reducing the “effective” complexity; but 
this reduction occurs rather by a process of subtraction than by a 
process of division, as in Birkhoff’s case. It must also be remarked 
that even when all elements are different, the total excitation in our 
case does not monotonically decrease with the complexity n. Equa- 
tion (1) of Chapt. XXIX of MB gives us the intensity of excitatory 
process at one of the n synapses, corresponding to n different elements 
with equal intensity of excitation. The total intensity of excitation is 
obtained by multiplying that equation by n, which gives 


E'tor, = (KiS — K,)n — (KS 4+ K,n(n—1). (30) 
This expression is zero for n = 0 and has a maximum for 


. ye 2, +-Y 
ii 2(K.S + Kz) 


after which it decreases to zero and becomes negative. We should 
therefore expect that in a rather simple pattern, consisting of a very 
few elements, too much symmetry will be rather a disadvantage from 
the aesthetic point of view, for in that case the total excitation in- 
creases with increasing complexity. But for very complex patterns, 
where the complexity is greater than the optimal n*, some symmetry 
is pleasant, as reducing the complexity and bringing it closer to the 
optimal. In the general case of unequally excited elements, these re- 
lations become much more complicated. 

If a subject or animal is conditioned to a certain polygonal figure 
and then another one is presented to him, which has some elements n 
common with the first one, the intensity of the response to the second 
figure can be calculated by the method used in Chapt. XXIX of MB. 
Thus a quantitative theory of discrimination of patterns. may be de- 
veloped. Two patterns will be the more similar, the more common 


* 
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groups (ne;) in various centers they have. If all of the neurones in A 
send off fibers to a center A, , so that A, is always excited, whenever 
any angle no matter of what size is presented, then A, is a center that 
responds to “angularity” in general. An animal trained to choose a 
square, when a square and a circle are presented, will choose the more 
angular figure, when two other but different figures are presented. 
This provides a basis for the understanding of some interesting re- 
sults of H. Kliiver (10) on the equivalence or nonequivalence of cer- 
tain pairs of geometrical figures in the behaviour of monkeys. A de- 
tailed discussion of this problem must be made the subject of a sep- 
arate paper. 

The mechanism discussed here suggests a definite way for the 
generalization of the theory, including the aesthetic measure to cur- 
vilinear figures. If we again consider a fiber of the kind discussed in 
section II, but consider instead of a sudden change of E a gradually 
increasing EF , then we shall find by using the fundamental equations 
developed in MB, XXII that to any given variation of E' with respect 
to time there will correspond a definite intensity of excitation of a 
definite duration. When a curve is followed by the eye with a definite 
velocity, then E,, , H:; and E;, are not constants, but are known func- 
tions of time, determined by the shape of the curve. Therefore we 
may calculate the corresponding intensity of the excitation in a center 
C., to which appropriate fibers are leading. We may then calculate 
the minimum perceptible curvature and other interesting things. We 
may also ask, what particular shape of curve gives the maximum ex- 
citation at C. (cf. MB, XXII). 


VI 

Experiments by Edmund Jacobson (11) indicate that thinking 
of a vertical object, while not causing actual muscular contractions, 
yet sets up very weak action currents in the eye muscles, such as 
would correspond to a vertical movement of the eye. Thinking of 
horizontal objects gives currents corresponding to horizontal move- 
ments. This suggests that also proprioceptive impulses of the kind 
discussed in section III may be set forth without the muscles perform- 
ing the actual movements. It has been observed that complete paraly- 
sis of the eye muscles does not affect recognition of shapes. This may 
be due to the taking up of the functions of the eye muscles by other 
muscles of the body, which may imperceptibly move as a whole, or it 
may involve a mechanism of gestalt-recognition, discussed previously 
(MB, XXVII). 

Finally, it may be shown, that a mechanism based on our funda- 
mental equations developed in MB can be conceived, in which, when 
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a chain of neurones is excited along a contour line in a brain center, 
a wave of excitation is set up, moving along that chain with constant 
velocity. The vertical and horizontal component of the velocity of 
that wave may then formally play the same role as the vertical and 
horizontal movements of the eye. Such mechanisms will be discussed 
separately. If such a mechanism is at work, then the central wave 
may continue to travel even for some time after the visual object is 
removed from the field of vision. 

The main thing is that we have here a picture which provides for 
the analysis of complex patterns, and for the recognition of its ele- 
ments, and gives also a basis for a quantitative exact theory. Most 
likely this mechanism is operating simultaneously with the one dis- 
cussed before (MB, XXVII). The latter provides for recognition of 
gross features and perception of “wholeness”, the former for analysis 
of details. 
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CONDITIONING CIRCUITS 


A. S. HOUSEHOLDER 
The University of Chicago 


A circuit of mutually exciting neurones capable of approaching 
and maintaining a state of permanent excitation has been described 
by Rashevsky as a possible mechanism for the production of condi- 
tioned reflexes. The mode of approach of such a circuit to the steady 
state is examined in some detail under slightly more general assump- 
tions, and an estimate is made of the order of magnitude of the time 
required to reach this state. 


Rashevsky has proposed a simple neural mechanism for condi- 
tioning and has described qualitatively some aspects of its behavior 
[3, chap. XXV; 2]. The mechanism consists only of two nerve fi- 
bers, each exciting the other, having one synapse in common with a 
synapse of a chain leading from the conditioned stimulus to the ef- 
fector organs, and reached also by a nerve leading from the uncondi- 
tioned receptor (Fig. 1). A priori, initiation of excitation in one of 
the nerves of such a pair could lead to one of three possible results. 
Most obviously, after a few exchanges the excitation might die out. 


P 


4 
FIGURE I 


Or, the excitation might increase with each exchange until it becomes 
injurious. Or, finally, it might build up, and asymptotically approach 
some limiting value for each nerve, where a state of permanent exci- 
tation is reached. Naturally it is the last case that is of interest, and 
we wish here to discuss in greater detail the circumstances and the 
manner in which such an asymptotic state may be approached. 
Rashevsky assumes that a stimulus of intensity S applied to a 
nerve and exceeding a threshold value h , produces a state of excita- 
tion in the nerve, this excitation having an intensity E which is a 
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funciion of S—h. It is assumed that this function EZ (S—h) has the 
properties 

(a) E(S—h) is properly monotonically increasing ; 

(b) E'(S—A) is properly monotonically decreasing ; 

(c) (0) =0; 

(d) E(w) < om. 

Property (d) signifies that E has a finite asymptotic value. It is no- 
where employed and may be omitted, except for the non-essential 
equations (7) and the paragraph which follows. 

The result of the excitation of the nerve is the production at its 
end of a substance «, capable of acting as a stimulus S for any nerve 
leading from the synapse at which « is produced. It is assumed that 
the rate of production of « is proportional to the intensity E' of exci- 


tation of the excited nerve, but that « is dissipated or destroyed at a 
rate proportional to its own concentration. Thus* 


de/di =alE(S—h) —e], (1) 
where the factor a is a property of the substance <. It is introduced 
into the function EF only for convenience. By a change of time units, 

t=at, (2) 


the equation (1) becomes* 
de/dt=E(S—h) —E. (3) 
Strictly, when a stimulus S is applied, the production of « could 
not start instantly, but only after the nerve impulse has had time to 
traverse the nerve, so that S and « must be computed at different 
times. We shall assume, however, that this propagation time is neg- 
ligible. Hence if two fibers are combined to form a conditioning cir- 
cuit, each ¢ becomes an S for the other nerve, and we have [2, 3]: 
de,/dt = E,(&—he) —&,, 
(4) 
d e,/d t = E,(e, —h,) —&. 


We have supposed that the first nerve is stimulated by «, and pro- 


duces &2. 
Rashevsky proposes for the function EF the form 


EK = = [1 — €-29(8-h)] , (5) 


“For analytical convenience, Rashevsky’s proportionality factor A is ab- 
sorbed into the function E 
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where J, @ and a are characteristic of the nerve. However, the im- 
portant qualitative features of the solutions of (4) are consequences 
only of assumptions (a) to (d) imposed upon the functions EF’. Con- 
sequently it seems desirable to study first the more general case, after 
which we shall apply the results to the special case to see what further 
properties appear. 


I. GENERAL CASE 


1. The Equilibrium Points. From the form of the functions E, 
it is evident that the curves C, and C, (Fig. 2) defined by 


Ci: €, = E,(e—h), 
(6) 
C2: & =E,(e,—h,), 


intersect in two points in the first quadrant, or not at all. We assume 
these points of intersection, P and Q, to be real and distinct, with P 
nearest the origin. Let P and Q have the coordinates (p:, p.), and 
(41, G2) respectively. 

From (d) we see that C, and C, have asymptotes 


€, = E,(0), 
(7) 
E2 = E,(0), 


respectively. Then P and Q lie within the rectangle formed by the 
two asymptotes and the two axes. Coordinates of these points are 
easily calculated. Thus, taking the e,—axis horizontal, through the 
intersection of this axis with C, draw a vertical line to C, ; from this 
point draw a horizontal line to C, ; then a vertical toC,; ..... 
These sequences of abscissas and ordinates approach p, and p.. Start- 
ing from an asymptote instead of an axis one obtains q, and q. to as 
close an approximation as may be desired. 

Evidently at P the slope of C, exceeds that of C,, while at Q the 
slope of C, exceeds that of C,. Hence 


E’,(p, — h,) EB", (p2— he) > 1 > E's(Q1 — Ii) B'2(G2— re). (8) 


The concave side of the curve C, is the region of positive «1, 
and the concave side of C, the region of positive «’,. Hence we note 
that paths (solution curves) drawn in the «-plane through points in 
the vicinity of Q tend toward Q, while those in the neighborhood of 
P , at least through points between the curves C, and C., are directed 
away from P. Moreover, P and Q are both equilibrium points since 
there both derivatives vanish. We may infer, then, that P is a point 
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of unstable, Q of stable equilibrium. For a more complete analysis 
we make some transformations. 


Let 
Y= &—D, Yo = 2 — Pr. (9) 
Then 
Y'1 = G2(Y2) —4, 
(10) 
Y'2=Q1 (1) —Y¥, 
where 
Pi(y:) =E, ("1 + pi— h,) —p., 
(11) 


P2(Y2) = E2(Y2 + D2— he) — Di. 


The functions gm have the properties (a) to (d). Because of (a) we 
may write 


9: (9:) = Bry: [1—Bs mH Yi(ys)], B>O0, 


(12) 
P2(Y2) = Bs Y2[1 — Bs Ye Y2(y2)] , fb. >0, 
where 
Y,(0) = Y.(0) = 1. (13) 
Then because of (b) 
B>90, B>090, (14} 
while (8) gives 
Bi Bo > 1. (15) 
From (a) and the fact that p(0) = 0, we have 
1 9 F, >, t.— fig F, > 6. (16) 
Again from (a) and (b) 
gi>0, g2>0, 
(17) 
y'1<0, y’2 <0. 
A similar transformation 
a4=&—qh, Ro = 2 — Ae (18} 
gives 
21 = y2(%2) —%, 
(19) 


2's = yi(%) —&, 


with the functions 
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y; (21) = E,(2,+4¢.—h) —&, 


Wo (22) = E, (2. + de— he) — 41, 
having the properties (a) to (d). Again we may set 
y, (21) =y7*, 2,[1 — ys 2: Z,(2:)] ’ m>0O, 
Wo (Ze) = ys Zo[1 — ya 22 Zo (2%) ] , v>O0, 
where 
Z,(0) = Z.(0) = 1, 
and 


¥3 > 0 ’ 4 > °. 
Finally, as above, 


1>1%, 


1— 32,2, >0, 1— y¥,2,2.>0, 
and 


yi>O0, yo>O, 
yi <0, yr <0. 
Now equations (10) can be written 


Y's =— 4+ Bo Yo — Bo Bs Ye Yo, 


Y2= BP Y1 — Y2 — P, Bs y?, 
and the characteristic equation 
—l1—Jj B. 
= (1+ 14)? — P; B% =0, 
| Bi mn ham A | 
formed from the linear terms, has the roots 


4=—1+4,i>0, A=—1— 6,8.< 0. 


(20) 


(21) 


(22) 
(23) 


(24) 
(25) 


(26) 


(27) 


(28) 


(29) 


Then known theorems on differential equations [1, pp. 51, 52, 34, 19] 


show the following (Fig. 2): 


(i) There is a single path P,* from above and a single path P.* 
from below, each approaching P as t becomes positively infinite. At P 


these paths both have the tangent 
B, Ys + Bo Yo =0. 


(30) 


(ii) There is a single path P,- from the left and a single path 
P.- from the right, each (receding from P, i.e.) approaching P as t 
becomes negatively infinite. At P these paths both have the tangent 
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Bi Y¥1 — Bo Y2 = 0. (31) 


(iii) Draw a circle with radius arbitrarily small about any 
point on P.- as center, and. choose any point on P,* (or P.*). Thena 
circle can be drawn having its center at this point, and a radius so 
small that every path cutting this circle on the right cuts also the 
first circle. 

Equations (19) can be written 


2, = — 2, + ys 22 — 2 4 Ze 22, 
(32) 
2. = v7. & — &2— y71 V3 &4 2; , 
whose linear terms have the characteristic equation 
— a 2 
MP | = (1+ )?@— 7 ye = 0, (33) 
} Fe.) 
with the roots 
m4 =—1+ yy» <0, fe =—1— yi v2 < 0. (34) 


Hence [1, pp. 50-51]: 

(iv) Through every point within some circle about Q as a cen- 
ter, there passes a path approaching Q as t becomes infinite. It i3 
therefore apparent that through every point above and to the right of 
the paths P+, the path approaches Q as t becomes infinite. 

The intersection of one of the paths P+ with the corresponding 
e-axis determines the “threshold of the circuit,” h* [2, p. 208]. For 
let ¢,, say, be the intercept of P.* with the ¢,-axis. Then @, is the 
smallest value to which e, can be raised while «. remains zero in order 
that the circuit shall remain in an excited state. This is immediately 
evident from (iv). The analytic solution for these paths offers, there- 
fore, a problem of some interest, but it is reserved for a later paper. 

Let us next transform to polar coordinates at P by the substitu- 
tion 


Yi: =e cos (p+), Y2=osin (y+), (35) 
where w satisfies 
sin w = £i/V 1+ 62, cosa = fo/VB a+ B.. (36) 


Then 
eo =y.,cos(yp+o)+y2sin(y+o), 


ey =—y.sin(y+o) + y'2cos(p-+ a) ; 
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fi — Big — Ve hg +- 








cos ( = Nt tao ' 
sei nas VF + Bs 

. b+ be—%pig?+--:- 
n( )= ——_— 

ae VF Pe 


Hence 


o = (6:6:—1) e+-:-, 
p' = Bs B'2(Bs Bs — Bo Bs) (6: + 62)? op —2f Boy+-:. 


For these equations the characteristic roots are of opposite signs: 
bif2a—1>0, —28,p.<0. 
The solution approaching 9 = g = 0 as t becomes negatively infinite 
has, in the limit, 
dp/d 9 =6; B*2(B: Bs — Bo Bs) /[ {3 Bs Bo— 1) (61 + B'2)*?] . (38) 


Interchanging indices, if necessary, it is no restriction to suppose that 


B, Bs < Bo Bs. (39) 


Then the path P-. departs from P with decreasing g , and hence with 
downward concavity. 
If we make again the substitutions (35) but with w defined by 


sin w = —f,/V + p., cos w = B./\V/ B*, + Bo (40) 


then we have 
f= G+ 2) 04+--~, 
(41) 


y = — 8: B2(B: Bs + Bo Bs) (B21 + B22) 0 +28: Bo p+t--. 
In the limit 


dgy/de = B, B2(B: Bs + Bo Bs)/[ (3 Bi Bo + 1) (87; + 2) */?] ‘ ’ 
42 


Hence the paths P* always have their concavity towards the point Q 
for large ¢. 

(v) With assumption (39) the paths P- are concave downward 
at P , while in all cases the paths P* have here their concavity directed 
toward the point Q. 

Similar transformations may be made at Q, and the results of 
these can be obtained from the foregoing by merely changing y to z 
and # to y. The equations so obtained from (41) give the polar 


(37) 
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equations of paths through Q having at Q the tangent 
V1 Zit yee =O0. (43) 


The characteristic equation of the linear term has the roots —1 
— y: y2 and + 2 y; y2, which are of opposite signs. Hence, there is a 
unique pair of paths approaching Q with the tangent (43). Then the 
equation corresponding to (42) shows that at Q these have their con- 
cavity away from P. 

Equations corresponding to (87) give the polar equations of 
paths approaching Q having at Q the tangent 


V1 21 — oe = 0. (44) 


Here the characteristic roots are —1 + y, y2. and —2 y, y2, both nega- 
tive. Hence there are infinitely many paths approaching Q along this 
tangent. But every path approaching Q does so along one tangent or 
the other [1, p. 23] . Hence 

(vi) There isa unique path from above and a unique path from 
below approacing Q with the tangent (43) as t becomes positively in- 
finite. At Q these paths are concave away from P. Through every 
point not on one of these paths, but lying above and to the right of 
the paths P+, there passes a single path which approaches Q as t be- 
comes positively infinite, which is tangent at Q to the line (44), and 
which is concave upward or downward when 


("1 Ys — V2 v3) /(3 Dc ae meas 1) 
is positive or negative. 


The last statement is justified by considering (38) with y re- 
placing /. 

2. The Inflection Loci. For more details concerning the form 
of the paths we consider the points at which their concavity changes, 
that is, the locus of their inflection points. This is obtained from the 
determinant 

(Ys 5 Yo) = YY" 2 — "3 Yn = 1 "gp — "5 Bg = I (21, a). (45) 

Differentiating (10) we have 

Y'1=P'2V2—Y1, (46) 


Yy"2 = Q's Yi — Ye. 
Hence 
[= P1V2—Q2Y?2’. (47) 


Since g’, and q’, are always positive, J has the real factors 
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I, = Vor Y's se V_2Y'2 ’ 
(48) 
I,= V_: Yi t+ Vo Y's. 
Both curves J, = 0 and J, = 0 pass through P and Q, but nowhere 


else intersect either C, or C.. 
Near P we have 


V¢o:1= 6, (1 — Bs Yit--), 


(49) 
V_2 = bo(1— fi. Y2+--), 
so that 
T,= (6: 62 +1) (—A + Bo Y2) + (6: Bo+1) (B; Bs y*s 
— B. Bs y*2) + B; Bo(B; Bs — Be Bs) Y1 Yo+-::, 
(50) 


T, = (8; Be —1) (B: ¥1 + Bo Yo) — (Bs Bo — 1) (Bi Bs Ys 


+ Bo Bs Y72) — Bi Bo(B: Ba + Bo Bs) Ys Yo +e. 


Hence at P the paths P* are tangent to J, and the paths P- are tan- 
gent to J,. Likewise at Q, I, has the tangent (43) and is therefore 
tangent to a unique pair of paths, while all other paths approaching 
Q are tangent at Q to IJ,. The curve J, passes from P to Q between 
C, and C,. 

Consider a short arc of J, containing P, and another short arc 
containing Q. Call these the P-branch and the Q-branch, respectively. 
Then these branches of J, , together with I, , divide a suitable region 
of the plane about P and Q into 6 sub-regions. Above the curve I, the 
function J, is positive; below it is negative. Between the branches of 
I, , the function I, is positive; outside it is negative. In the three re- 
gions where the signs are alike, J is positive, and the direction of con- 
cavity of the path is to the left of the advancing tangent; elsewhere 
it is to the right. 

We shall now show that every path, in crossing J, , does so in 
passing from its positive to its negative side. To do this we calculate 


0 I/o ee P'1 y's? —2 Q's (y': =f Q's Y's) ’ 
(52) 
01/0 Y2 = —o"2yY+ 29 (y's +9 y's), 
giving the components of a vector normal to the curves J, and I. and 


extending into the region in which the signs are alike. Along either 
curve 


Yi,0T/dy, + y20T/0y = p'1y¥F2—o'2y2. (53) 
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Now ¢”; and @”, are always negative, while on J, the quantities y’, 
and y’, have opposite signs. In particular, on the upper P-branch, 
y; > 0, and hence the expression (53) is negative. Hence at each 
point of the upper P-branch, the advancing tangent of the trajectory 
makes an obtuse angle with the normal vector (52). This normal 
vector here points to the positive side of J. , and hence the path passes 
from the positive side to the negative side. With due regard for signs 
an analogous argument applies at the other arcs of J,. Otherwise ex- 
pressed, all paths which intersect J, do so in passing, with increasing 
t, from the region between its two branches into the region outside. 
It is then geometrically evident that the P-branch, however far ex- 
tended, nowhere joins the Q-branch. Referring to (v) and (vi) we 
conclude: 

(vii) The locus of inflection points of the paths is a curve hav- 
ing three branches: a branch I, = 0 containing P and Q and lying 
between C, and C, ; two branches I, = 0, one through P and one 
through Q. The paths Ft are tangent to the P-branch of I, at P , and 
are elsewhere on the positive side of I,. There is a unique pair of 
paths tangent, at Q, to the Q-branch of I., and lying elsewhere on 
the positive side of I,. Through every point between these pairs of 
paths, there passes a path which approaches Q from the left tan- 
gentially to I, as t becomes infinite. Through every point between the 
P-branch and. the pair P+ there passes a path which intersects the 
P-branch and goes off to infinity. Through every point between the 
other pair and the Q-branch there passes a path which intersects the 
Q-branch and then approaches Q from the right tangentially to I, . 

It is of interest to find the direction of concavity of J, and J, at 
P , which can be done by setting 


Y¥.=o0f2, Y2= of: 
in J,, and 

Y¥i=0f2, Ye = — of, 
in7J,. This gives 


I, — o? By Bo (Be Bs + Bx Bs) ++: 
I, = Q By Bo (Bobs — B, Bs) +e, 


From (39) it follows that the tangent to J, is on the positive side so 
that J, is concave downward, as is also7,. At Q, like substitutions in 
y show that I, is concave upward, but the concavity of J, depends 
upon the sign of yo ys — y1 Ya 

3. Approach to Steady State. Consider a pair of equations 
analogous to (32) of the form 
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Z's = —2,+ y72%—9Ii> 
(54) 


Zo = 771 1 — 22 — G2, 
where g, and g, are functions of t. Let the initial conditions be 
(0) = £,, 22(0) =f. (55) 
It can be verified that these equations have the solutions 


1 
g, = i [et (y, City t.—J;) Se erat (y, C1 — Pe fo —Jz)], 





(56) 
2 = ‘2 Lem! (y. fi + 2 S2— Ji) — emt (y, 01 — yo So —Je)]; 
where ee 
I= [eC nls) + 72028)] ds, 
(57) 
1= fem a(s) ros] as, 
and uw, and uw are given by (34). 
If we set 
91 = 77270 2Z2, G2 = 7717325 hi, (58) 


then the g’s in (54) are indirectly functions of t through the unknown 
2’s, equations (54) are identified with (32), and (56) is a set of in- 
tegral equations equivalent to (32) and the initial conditions (55). 
Explicity, we have 


1 
_ 241 Lem! (ys la + yo So) + em? (1 li — yo Se) ] 
— VW y2 {em fer Ys 22 Zot 1 321 Zi) ds 
0 


t 
+. ent f CHS (y, yy 2?o Zo — V1 73 271 Z1) AS}, 
0 


(59) 


22 > o Lem? (y, Ci+ ye Co) — eft (y1 01 — yo C2) ] 


t 
— % y, {em f C-8 (9 ¥4 Zo Zo t+ 91 ys 21 Zi) ds 
0 
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ot 
— elizt | CMS (y2 Y4 Zo Zo — 1 3 21 Z1) A s}. 
0 


This pair of integral equations is non-linear, but nevertheless for 
at least those paths which converge to Q an iterative procedure for 
effecting a numerical solution is possible just as in the linear case. 
Take the terms outside the integral signs as first approximations 
2(t) to z(t): 


2, (t) = zn [em* (y, Ci + v2 fo) + erst (y, C1 — Ye Co) ], 
(60) 


1 
2, (t) = 2 [erst (1 an -- V2 Co) eae ebet (y, Cy aa Ce)] . 


If these values are used to replace the z, and z. under the integral 
sign in (59), we obtain second approximations 2,(¢) , 2,(t). 
The quadratures can be made exactly or by any suitable scheme for 
numerical quadrature. Substituting again with these results we get 
2), (t), 2@.(t), and so on sequentially. For arcs near to P it may be 
preferable to use, instead of (59), the analogous equations in y. 

It is desirable in particular to estimate the rate at which Q is 
approached, or the time necessary to come within an assigned dis- 
tance from Q. To make such an estimate we first obtain from (56) 


V1 Zit yo 2 = et (y, fi + yo lo—Ji(t)). (61) 


The left member of this equation is (y?, + y*.)'/? times the distance 
from the line (43) to the point (z,, 2.) on the path, and the first two 
terms in parentheses give the same multiple of the distance from this 
line to the initial point. For initial points not too far removed from 
the exceptional path P.- running from P for t = — o to Q for 
t=-+ ow, it is evident geometrically that the distance from this line 
may be used as a fairly satisfactory measure of the distance from Q. 
As t becomes infinite the left member of (61) vanishes, and hence so 


does the right member. 


Set 
V1 Zit eo = 2(y1 Ci+ ve Co). (62) 
Then from (61) 
1 1 J,(t) 
t= —] — — log (1 — ———_—__). 63 
My baie My og ( V1 Cy — V2 F ( ) 


Neglecting J, amounts to finding the time on the assumption that the 
approximation 2, , 2, is exact. However, a better value for ¢ can 
be found by evaluating J, (to a sufficiently large t) with z, and 
2, replacing z, and z.. 
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For initial points relatively close to the line (48) a like discussion 
can be made using y; 2: — y2 2. 


II. THE CASE OF EXPONENTIAL DEVELOPMENT 


For Rashevsky’s case of exponential development of excitation 
a preliminary transformation yields a special symmetry of the equa- 
tions. We start with the equations 





i, = A I, [1 — E-9202(E2-ha) ] —E,> 
6. 
(64) 
és - A # [1 san 2) @-2101(€1-ha) ] =——€,% 





1 


after changing the time unit as in (2) to eliminate the factor a. 
Then the substitutions 


Y= 4,46, & ’ Le = Ae Oo &2, 
¢, = Al, az 62/6, ’ Co = Al, a; 6:/62, (65) 
b, = C, E2101 ‘ b? = C, E.2202h2 ' 


reduce the system to the form 


, 


2’, =¢c,—b,e"—27,, 


(66) 
z', = ¢,—b,e"%—2. 
The transformation analogous to (9) gives 
y= f.(1 —@") —N", 
(67) 
Y's aad (1 aaa e™) —Ye2, 
where 
f?, = b,e" =¢,— Dre, BP. = by 0? = C2 — D1. (68) 
The transformation analogous to (18) gives 
2’, = y*,(1—e*) —2Z,, 
(69) 
2.=7,(1—e*) —z, 
where 
vy, =be%=—Q, Ye= b, e-@ = C, — Qh. (70) 
Evidently 
Bs = bs = ¥s = ¥e = 1/2. (71) 


Hence convention (39) becomes 











Aas 


as 


~~. i Ze) | 6a 
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Bi < fe. (72) 
It then follows that 
Yi < Ye. (73) 
To show this, let 
%=—PDi, T2 = A2—DPre- 
Then 
= fe" = B—r, 7. = B?, e-? = B, —7,. 


If % > 71, then 
771 — yo = (81 — Be) +(%1—) < 0 
and (73) is true. If, on the other hand, 7. < 7,, then 
1/yPo= CF" B?,/Pp <1 


and (73) still holds. It follows, therefore, that I, is concave down- 
ward both at P and at Q. 

It can be shown further, in this case, that paths cross I, only 
from above, never from below. First we have from (48) 


L= Bb, e-M/2 yy’, — Bo ew/2 y',. 
Then from (53) 
Y's I,, + y'2 I,, = — Peny' r+ Bev y',8 
= B, et? y'? y'(B, e-/? — B, e-v/?) 


along J,. This quantity is negative and the statement is true at least 
near P and on the right because of (72) and the fact that all factors 
outside the parentheses are positive. The quantity in parentheses can 
change signs on J, only at a point at which 


B, e-"1/2 Bb, e-"2/2 — 0) 
B, e-"1/2 y's see B. e-¥2/2 Y's =_ 0 


simultaneously. The curves defined by these equations intersect only 
on the curve 
yi—y2=0, 


8, — B?, — BP. e+ f,e%—y,+y,=0, 


or 
or 
B, — 8, = Yi — Y2 , 
and hence at a point for which 
Yi > Y2- 
On the other hand, at such a point 
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eur-w/2 = B./B, <1, 
Y¥.< Ye. 


Thus no such point exists and the sign changes only at P or Q. 

Since all paths have the same direction of concavity at Q. and 
some approach Q from below /,, it follows that all paths [ except 
those two which have the tangent (43)] approach from below. This 
completes the proof of the above statement. Hence every point above 
I, and to the right of the paths P* lies either on the path approaching 
Q from above along (43), or else it lies on a path which crosses /, 
once and then approaches Q tangentially to /,. 

The functions g of (54) and (57) are 


7; = y?. (e*? — z, — 1) = % yo Pemmsss, 
(74) 
92 = y?, (e-*: — 2, — 1) = \% v7, B27, —eee, 


For negative z, and z, both are everywhere positive, and hence J, is 
positive. Hence neglecting J, in (63) gives always too small a value 
for t, when ¢, and ¢, are both negative. This is apt to be the case of 
greatest interest since the initiation of excitation will normally take 
place by raising one of the e’s above the threshold of the circuit while 
the other « is initially zero. Thus one of the ¢’s is certainly negative. 
The other ¢, if positive, is necessarily small and the corresponding z 
becomes and remains negative after a short interval t. 

Some of the results obtained above are illustrated in Fig. 2. 
The crossing of J, from above, illustrated in the figure, has been 
proved only for the exponential case. The peculiar shape of the Q- 
branch of J, has not been discussed, but it is qualitatively correct, at 
least for the exponential case. The general concavity of J is not de- 
cided. In all other respects, the essential features are as indicated 
even in the general case. 

This figure is to be compared with Rashevsky’s Fig. 2 of 2, and 
Fig. 61 of 3, ch. XXIV. These figures, drawn from general considera- 
tions and without detailed analysis, are validated in the main. We 
note only the following points: Rashevsky does not give the J-curves, 
and the behavior near the exceptional paths to Q is left undecided; 
also the curve A B is our pair of paths P*, and should be concave in 
the opposite direction. 


Ill. THE TIME LAG 


If the propagation time for the two neurones is not negligible, 
the form of the problem is altered considerably. We shall merely for- 
mulate the problem without attempting to develop the properties of 
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the solutions. Let t be the time that elapses between the application 
of the stimulus to either neurone and the consequent development of 
e. We suppose, for simplicity, that this is the same for both neurones 
cr the circuit. We suppose that the time units have been chosen so 
that the parameter a = 1. Suppose, for example, that initially «, = 0, 
and that during the time-interval from t = 0 to t = 2 a, & is devel- 
oped by an afferent neurone and has at any time ¢ of this interval the 
value «,(t). Then from the time t = 0 to the time ¢t = a, there is 
still no production of ¢,, because of the propagation lag. But from 
t = a on the development of e, proceeds and at any time ¢ the devel- 
opment is determined by the value of «, at the time t — a. Hence, in 
place of the first of equations (4) we have 


é',(t) = A E,[e.(t—a) —h.] — a(t). " (75) 
For simplicity let us set 
no(t) = A E.[e.(t — a) —he], 
(76) 
yi(t) =A E,[e,(t—a) —h]. 
Then, as we have seen, 7.(t) is known on the interval from t = a to 


t= 38a. Hence from t = 0 tot = a, ¢,(t) = 0, while from t = a 
tot = 3a, e,(t) is given as the solution of 


e’, (t) = no (t) —e«,(t), 


(77) 
€, (a) = 0 . 
This equation is easily solved: 
t 
a =e [etm(s) ds, wihdde, (78) 


But then 7, (t) becomes known on the interval from t = 2atot=4a, 
and if the afferent neurone is no longer excited (if it is we simply add 
in the e, which it produces) we find in like manner 


t 
&. = €*{e 6, (2a) + fence) ds},2a<t<4a. (79) 
2a 


Proceeding sequentially we obtain a set of recursion formulas giving 
€, on the intervals (2n —1)a < t < (2n + 1)a, and «, on the inter- 
vals2na<t< (2n+2)a.. 
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THE RELATION BETWEEN THE INTENSITY OF EXCITATION 
AND THE NUMBER OF NEURONES TRAVERSED 


H. D. LANDAHL 
The University of Chicago 


In previous formulations the assumption has been made that 
the intensity of excitation or inhibition varies with distance. In this 
paper the simple case of a series of identical neurones is treated. 
The development shows five possible relations. In future theoretical 
formulations it is no longer necessary to postulate arbitrary rela- 
tions in case any of the derived forms can be used. 


In chapter xxii of his book “Mathematical Biophysics” N. Rash- 
evsky (1) discusses how the intensity of nerve excitation, traveling 





ae So 4 4 | ee & 
ps 4 i 
FIGURE 1 


along a chain of neurones, varies in a definite manner along the chain, 
the variation depending upon the constants of the neurones involved. 
The following development is confined to the case of a series of iden- 
tical neurones over which a nerve impulse may pass. (Fig. 1). We 
consider neurones of the general excitatory type (1), producing both 
eand 7. The excitatory and inhibitory factors e and j are postulated 
as being produced at a synapse and their difference, if greater than 
the synaptic threshold, serves as the stimulus for the higher order 
nerve. If the chain is an inhibitory one, the last neurone of the series 
is inhibitory (1). 

Let a constant stimulus S be applied to the afferent end of the 
first neurone of the series. The intensity of excitation in the first 
neurone is then given by 


E,=al,(S—h), S>h, (1) 


where S is the strength of the stimulus, h the threshold, J, the inten- 
sity of each individual nerve impulse, and a a constant. By hypothe- 
sis we have a, I, , and h the same for each nerve fiber. 

Now for the k‘* neurone the differential equations for the exci- 
tation and inhibition factors are 
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de 
a = A.E;, — AeEx > (2) 
dj, . 
“dt = B.Ex — bejx » (3) 
where we also have 
E. — a Ty (&-1 — Jra—h). (4) 
The solutions of equations (2) and (3) fora constant EH}, are 
AE: 
a =" (1—e'), (5) 
and 
: B.E 
teal ile a edi (6) 


where we have e, = 7, = 0 for t = 0. For convenience the origin of 
the time axis is moved forward with each k. It is necessary to take 
this into account if the time delays are of interest. 

Let us now consider a case in which the stimulus S has been ap- 
plied sufficiently long so that we may neglect the exponential terms. 
Then we have 


; A. . Be 
je — J. = =; Ex - (7) 
Define P, by the relation* 
A, B. 
Oa Sag We (8) 
Then using equations (1), (4) and (7) we have the following rela- 
tions 


— <— al(S—h) =P,S—P,h, 


bs A. B. A. B. 
Ja —_? al, [ —e aI,(S—h) —h] 


= P2S — Ph —P,h, 
_ he ee a 
es-— js — ee he alo ae ty a I,?(S h) 


A. B. 
a ee iy al,h—h] 


*P, should not be confused with the symbol P used by Rashevsky (1). 
P was defined by the expression in parentheses in equation (8), or P = P,/al,. 








(2) 


(3) 


(4) 
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3 — js = PS — Ph — Ph — P,h, 


én — Jn = PS — P, (Pi + Py? +--+ DA, (9) 
and replacing the geometric series by its sum, we have 
eT Se eS ee for P, 1 (10) 


1—P, 1—P, 


or 
&s— jn = S—nh, forP,=1. (11) 


In Figure 2 the curves for the various cases are shown. For 











t Pm 
" n ae 


FIGURE 2 
P, < 1 the function decreases exponentially from the initial value S 
and approaches — i sm (Curve I). For P, = 1 we have a straight 
en 1 

line with a negative slope, —h (Curve II). For P, > 1 we have three 
possibilities. If P, > S/(S—h), en — jn is an exponentially increas- 
ing function of n (Curve III,). If P, = S/(S—h), then en —jn = S,a 
constant (Curve III,). If P; < S/(S—h), en — jn decreases monoto- 
nically with n and becomes negatively infinite (Curve III.). 

It may be that the number of neurones between two points is pro- 
portional to the distance. Equation (10) or (11) then gives the rela- 
tion between the net excitation at the efferent end of a nerve fiber as 
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it varies with the distance of the nerve fiber from an arbitrary origin. 
The cases I and III, may be written 


S(z) = Ce e™itd, (12) 
or if we may neglect d, 
S(z) =&,e'™" ., (18) 


An equation of the above form was assumed by Householder and Ame- 

lotti (2). Here we see that it is an approximation to equation (12) 

or two cases of equation (10) and that it is obtained directly from 

postulating a series of similar neurones. The case III, is also a spe- 

cial case of equation (13) where m = 0. This simplest case was 

assumed by Rashevsky in treating the theory of delayed reflexes (3). 
Let us now solve the equation 


E,—Iya=h (14) 


where 7’ is the value of » at which the excitation wave stops. Substi- 
tuting equations (10) and (11) into (14) we have, for P, ~ 1 





Pr.” S + Pp, h— P,"* S —_ h, (15) 
or 
—" | h / 
"fog °° SUP) FPR " 
and similarly for P, = 1 
,_S—h 
iagiens “ans (17) 


In equation (16), ”’ is real unless [S(1—P,) 4+- P,h] < 0 (Curves 
III,, III). Differentiating with respect to S, we have from (16) and 
(17), for real values of n’, 


dn’ P,—1 1 


dS , for P 18 
dS log P, [Sd—P) LP ° for P, 1 (18) 





since 
[S(i—P,) + Pk] >0, 
and 
dn’ 1 = 
ag =F >9> for P,=1 (19) 


so that regardless of the values of P, , n’ increases with S. 

Let S, and S. be the magnitudes of two stimuli incident upon the 
afferent ends of two similar chains of identical neurones. Then it fol- 
lows from equations (10) and (11) that the difference in stimulus 
value at the end of the chain is given by 


8 — §’, aad Ein — Jin — (€2n — Jon) = y(S, — S,) (20) 
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where y is a constant. Since generally we know only the relative 
values of the stimuli, equation (20) permits the use of S, — S, instead 
of S', — S’. ; that is, we may use the intensity difference at any point 
along the two parallel series. 

In the case of an excitation wave passing over a series of iden- 
tical neurones, the excitation may increase or decrease with distance, 
or remain constant, depending on the choice of parameters. Evidently 
the assumption that any or all of the parameters vary with distance 
will lead to still more complex relations than those derived above. 
However, such complications need not be made at present since the 
simplest case already gives sufficient possibilities. 

This investigation has been made possible through a grant from 
the Rockefeller Foundation to the University of Chicago. 
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A NOTE ON DWYER: THE DETERMINATION OF THE FACTOR 
LOADINGS OF A GIVEN TEST 


If a factorial analysis of a battery of tests has been performed, 
and the centroid or principal axis factor loadings and the primary 
trait loadings* for each test have been determined, it is sometimes 
desired to discover the loadings of additional tests on the same pri- 
mary traits without the necessity of factoring a new correlation ma- 
trix consisting of all the inter-correlations of the original battery of 
tests plus the additional tests. 

In a recent article, Dwyer} has presented a method for determin- 
ing the factor loadings of additional tests. The method as presented 
by Dwyer is primarily adapted for use with the results of a centroid 
or principal axis analysis. This note is merely a restatement of Dwy- 
er’s method in more condensed formulation, and an extension of it 
to the determination of primary trait loadings. 


a. Let there be: 
n tests, with general subscript 7, of known loadings on r 
factors with general subscript m; let the factor loading of 
the 7" test on the m' factor be @;; and let Finj = |@mj| . 


b. Let there be s additional tests, with general subscript f¢, 
whose loadings @,,; are sought, given the correlation co- 
efficient r;; between each of the s tests, ¢, and each of the 
n tests, 7; let Fm; be the matrix of factor loadings dm;; and 
let R;, be the matrix of correlation coefficients, 7;;. 


Referring to page 174 of Dwyer’s presentation, and rewriting 
his second set of equations in the above notation, we have: 


FP’ in Fay = Rye (1) 
If both sides of equation (1) be multiplied on the left by F,,;, we have 
F nj Pi Frnt — F'nj Riz . (2) 


* Thurstone, L. L. Vectors of Mind. Chicago: University of Chicago Press, 


1935, Ch. VI. 
+ Dwyer, Paul F. “The determination of the factor loadings of a given test 


from the known factor loadings of other tests,” Psychometrika, 2, 1987, 173-179. 
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Now, under the conditions given in the definition of F'‘»;, the matrix 
product F',;F’jm is non-singular and has an inverse, whence: 


Ft = (F'mj P" sa) * Fnj Rj: . (3) 
If we define 
Fnj F" im == Gnu ’ (4) 
then (3) becomes 
Frnt = =G} i ms Rit. (5) 


Now if, as is the case with weil or principal axis factor load- 
ings, all non-diagonal elements of Gn are sensibly zero, i.e., 





Inu =0, (mM), (6) 
then Gny is a diagonal matrix, and we may write: 
Cn = Dinan »where dina =0 ’ dinm = > mj (7) 
j=l 
whence 
| 1 
Gi =D =| 
mM mm | n (8) 
| > mj 
j=1 


If what are ultimately desired are not the loadings on an arbi- 
trary set of centroid axes, but the primary trait loadings, v:p, where 


Vip S Vtp| = F'mm T'mp ’ (9) 


(T np being a transformation matrix, determined i in the previous analy- 
sis of the n tests) then 


Vip = R’:; ee —. lw > (10) 


In the factorial situation, F’;,, is known from previous analysis, 
Gnu is given from a knowledge of F’;,,, and G1, may be computed by 
Aitkens’* procedure or by Tucker’s modification,+ if equation (6) 
does not hold, or from equation (8) if (6) and (7) be true. Tn) is 
known from the previous analysis, and V;, may be computed directly 
from equation (10). 

If, as may frequently be the case, there are more additional tests 
than there are factors (s > 7), then it will be computationally simpler 
to define a new matrix. 


Omp = G2, T mp (11) 


* Thompson, G. H., “Some points of mathematical / i in the factorial 
analysis of ability,” Jour. Educ. Psychol., 2'7, 1986, 37- 

+ Tucker, L. R. “A method for finding the inverse a: a matrix,’? Psychomet- 
rika, 8, 1938, 189-197. 
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and to compute V;, from 
Vip ia Rj FP’ im Ump . (12) 


If, as may well happen, it is desired to modify the resulting sys- 
tem of primary traits in the light of the primary trait loadings of the 
additional tests, this may be accomplished by the use of any of the 


standard procedures. 
CHARLES I. MOSIER 


University of Florida. 
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